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Preface 


The first edition of The Genetics of the Pig was published more than 12 years ago. During the 
past several years a deep transformation has occurred in all of the biological sciences. Beginning 
in 2001, several mammalian genomes were sequenced. The latest addition to this list is the pig 
genome, whose first draft sequence was obtained at the end of 2009. While the annotation of the 
pig genome is still an ongoing process one can confidently expect that in the near future this 
process will be significantly advanced. These developments in the biological sciences have led to 
the practical implementation of powerful laboratory methods which, together with the emer¬ 
gence of advanced bioinformatics, have caused dramatic accumulations of new data in diverse 
databases. 

Genetic theory and practice have also evolved considerably over the last 12 years. The previ¬ 
ously separate fields of classical and quantitative genetics have now been joined with genomics, 
molecular genetics and immunogenetics, creating new methods and insights into understanding 
numerous biological processes. Today, more than in the past, genetics serves as a key pillar of 
modern agricultural sciences, medicine and the biotechnological industry. Animal breeders and 
geneticists, more widely than ever before, are using knowledge from the different fields of genet¬ 
ics for development and improvement of their livestock. Given these dramatic changes in current 
genetic applications and theory it seems an appropriate time to publish the second edition of 
books devoted to genetics of domestic livestock species. The Genetics of the Pig opens this 
series of books. 

Domestication of the pig occurred some 9000 years ago, and the consequences of this pro¬ 
cess have been tremendously important for the food supply in different civilizations. Today, the 
pig continues to be a valued source of food worldwide. Modern biological discoveries and techno¬ 
logical improvements in management practices have revolutionized pork production. 
Approximately one billion pigs are raised annually worldwide and pork remains the dominant 
meat source, representing over 40% of all the red meat eaten. Owing to its physiological and 
genetic similarities with man, the pig serves as an excellent animal model for biomedical research, 
and as an important source for xenotransplantation and other potential medical applications. 
Numerous animal scientists, geneticists, veterinarians, livestock producers, medical researchers 
and students are interested in the biology and genetics of the pig. This new edition of the book 
brings a wealth of knowledge that we hope will be useful for this diverse group of scientists and 
practitioners around the world. 

The purpose of this book is to present in one location a complete, comprehensive and 
updated description of the modern genetics of the pig. It is our intention to combine essential 
knowledge from the various fields of genetics and biology of the pig, integrated with livestock 
management aspects, in order to provide an updated and informative reference book. The 
genetic improvements in the pig industry over the past couple of decades have been very impres¬ 
sive, with growth rates increasing, feed efficiency improving and a continued rise in overall 
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leanness. These successes have in great part been due to the incredible progress in the under¬ 
standing and application of genetics to pig production. As recently as 1990, only about 50 genes 
and markers were mapped or assigned to individual porcine chromosomes. Now the genome 
draft sequence has been obtained and the completion of very detailed genetic and genomic 
maps has been accomplished. Furthermore, genetic improvement within the pig industry that 
rests on the introduction of gene tests and on genomic selection based on thousands of genes is 
coming. 

This book is addressed to a diverse audience, including students, researchers, veterinarians 
and pig breeders. The initial two chapters are devoted to the taxonomy and domestication of the 
pig. This area has advanced significantly over the last decade. Chapters 3 and 4 extensively cover 
the genetics of coat colour, morphological characteristics and inherited disorders. Molecular 
genetics and immunogenetics are described in Chapters 5 and 6. Cytogenetics, chromosome 
maps and genomics are presented in Chapters 7 and 8. Chapter 9 concentrates on the genetics 
of behaviour, while the next three chapters are relevant to the biology and genetics of reproduc¬ 
tion, modern reproductive technologies and the genetics of development. 

Chapter 13 addresses genetic diversity and concerns for maintaining exotic and rare local 
breeds. The genetics of performance traits and carcass and meat quality traits are discussed in 
Chapters 14 and 15. Chapter 16 is devoted to overall genetic improvement. Chapter 17 examines 
the pig’s contribution and future potential as an important model for biomedical sciences and a 
key species for possible organ donation. Chapters 18 and 19 cover pig breeds and genetic 
nomenclature. 

A significant effort was made to consistently implement the current genetic nomenclature. 
Unfortunately, there are a few shortcomings that seem to be unavoidable. The genetic nomencla¬ 
ture requires that not only genes and alleles, but also traits, should be spelled according to the 
American style. Essentially, just a few words like colour (color), behaviour (behavior) and flavour 
(flavor) cause some difficulties. It is not always absolutely clear whether a word like the above 
describes a trait from the formal point of view and as such the American style should be used. As 
this book is published in the UK the British spelling rules should apply in all other cases. More on 
this nomenclature matter can be found in Chapter 19. 

The considerable and never-ending progress in genetics research makes it impossible to 
cover all new and relevant literature. Therefore, it is inevitable that some publications will not be 
cited. We hope that any errors or omissions will be noted and brought to our attention. Also, 
during the time when this book was being written there were incremental changes in genetic 
nomenclature. Despite the consistent attempts to introduce all these changes into the book, we 
probably cannot claim to have been completely successful. Finally, in no way is this book meant 
to replace the many fine textbooks devoted to the theory of animal breeding. 

This book is the result of international efforts. These efforts are dedicated to our families, to 
supportive colleagues and to the pig industry that employs and feeds hundreds of millions of people 
worldwide. The editors offer a special thank you to each of the authors who contributed chapters 
so graciously and without reservation. Previous authors are also thanked for providing materials 
useful to this second edition. Publication of colour plates was generously supported by PIC (part of 
Genus pic), Hendersonville, Tennessee, USA, and the USDA/CSREES Pig Genome Coordination 
program. Finally, the editors also acknowledge with appreciation all the efforts of CABI to help in 
producing the book. It is our hope that this text will serve as a useful resource for all those people 
who study or work with pigs. 

Max F. Rothschild 
Anatoly Ruvinsky 

November 2010 
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Introduction 

According to the current classification, pigs 
belong to order Cetartiodactyla, which includes 
even-toed ungulates (the former order 
Artiodactyla), and whales and dolphins, which 
are representatives of the former order 
Cetacea (Murphy et a/., 2001; Novacek, 
2001). Cetartiodactyla diverged from placen¬ 
tal mammals approximately 87.2 million 
years ago (mya) (Murphy and Eizirik, 2009). 
There are three well-established suborders: 
(i) Tylopoda - camels and llamas; (ii) Suiformes 
(also known as Suina) - pigs and peccaries 
(and formerly hippos); and (iii) Ruminantia. 


The fourth and more recently created 
suborder - called Cetancodonta - includes hip¬ 
pos, dolphins and whales (Price et a/., 2005; 
O’Leary and Gatesy, 2008; Huffman, 2009). 
Phylogenetic analyses of the cytochrome b 
sequences from 264 of the 290 extant 
Cetartiodactyla show that the Suiformes and 
Ruminantia cluster together as a sister clade of 
Cetancodonta, while the Tylopoda cluster in a 
separate clade (Agnarsson and May-Collado, 
2008). Molecular genetic studies of the 
Cetartiodactyla have shown good agreement 
with the basic structure of their phylogeny as 
supported by morphological data (Novacek, 
1992), but the position of hippos and camels 
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remains contentious (Thewissen et a/., 2007; 
O’Leary and Gatesy, 2008; Spaulding et a/., 
2009). 


Suborder Suiformes 

The Suiformes consist of two living families: 
Tayassuidae (peccaries) and Suidae (pigs). 
Currently Tayassuidae has three genera, each 
with a single species, and all these species live 
in the Americas. The modern Suidae family is 
more diverse and consists of six genera with 
18 or 19 recognized species depending on the 
source of information. Many of these species 
live in South-east Asia, others broadly in 
Eurasia and several species are found only in 
Africa. 


Family Tayassuidae 

The family Tayassuidae (peccaries) diverged from 
pigs in South-east Asia somewhere in the late 
Eocene, or possibly later, and then migrated to 
Eurasia, Africa and North America, finally colo¬ 
nizing South America (Ducrocq, 1994). However, 
the modern species live only in the Americas. The 
peccaries, like pigs, have a snout disc, but the 
differences between the two families are signifi¬ 
cant. The stomach in peccaries is subdivided into 
three compartments. In several features of the 
digestive system, peccaries resemble ruminants; 
it is not known whether these features developed 
independently. Peccaries have three hooves on 
their back legs. In addition, their upper canine 
teeth are pointed down and the total number of 
teeth is 38. A scent-producing gland located on 
their backs is another specific feature. Peccaries 
are significantly smaller than pigs, with an aver¬ 
age body size of approximately 30kg. 

There are three recognized extant peccary 
species that are distributed on the American 
continent: Catagonus wagneri (Chacoan pec¬ 
cary), Tayassu pecari (white-lipped peccary) and 
Pecan tajacu (collared peccary) (Grubb, 1993a). 
The collared peccary spread across South and 
Central America and the southern part of North 
America. It is a very common species, reprodu¬ 
cing well and widely hunted because of its 
good meat and leather quality. The white-lipped 


peccary is bigger than the collared peccary. This 
species is spread from southern Mexico to the 
south of Central America. Hybridization between 
the white-lipped peccary and the collared pec¬ 
cary has been observed in captivity (Sowls, 
1997) and in the wild (Andrea et a/., 2001), with 
the wild hybrid being sterile. The Chacoan pec¬ 
cary was known only through fossil records until 
it was discovered on the Paraguay-Bolivia- 
Argentina border about three decades ago 
(Wetzel, 1977a). This species has been shown to 
differ significantly from the two other peccaries 
in chromosome number (2n = 20) (Benirschke 
et a I., 1985; Benirschke and Kumamoto, 
1989). 

There are conflicting hypotheses about 
the evolution and relationships of modern pec¬ 
cary species. Based on osteological and dental 
traits, collared and Chacoan peccaries were 
considered to be more closely related, while 
the white-lipped peccary was considered to be 
a member of a separate clade, along with other 
extinct species (Wright, 1998). However, other 
morphological studies (Wetzel, 1977b) sug¬ 
gested that collared and white-lipped peccaries 
are more closely related to each other than to 
the Chacoan peccary. In contrast, phylogen¬ 
etic studies using mitochondrial and nuclear 
DNA sequences show that white-lipped and 
Chacoan peccary species are more closely 
related to each other than to the collared pec¬ 
cary (see Fig. 1.1) (Theimer and Keim, 1998; 
Gongora and Moran, 2005). Additional DNA 
studies have suggested that the geographically 
widespread and phenotypically diverse collared 
peccary may consist of at least two separate 
lineages deserving specific status, which are as 
genetically distinct as white-lipped and Chacoan 
peccaries (Gongora et a/., 2006). 


Taxonomy and Phylogeny 
of the Suidae Family 

Systematics 

The Suidae family includes the most widely 
spread species of non-ruminant even-toed ungu¬ 
lates, commonly known as pigs and hogs. All of 
them have an elongated muzzle with a snout 
disc and four-toe extremities with well-developed 
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side toes. The canine teeth are large and the 
upper ones are curved. The stomach is simple 
with an additional sac. The Suidae are omnivo¬ 
rous. This family traces back to the upper Eocene 
of Thailand (-35-40 mya), or possibly later 
(Ducrocq et ai, 1998; Liu, 2003). During the 
Neogene, suids greatly diversified into over 30 
genera, and colonized different parts of Eurasia 
and Africa, where they radiated further (Pickford, 
1993, 2006). The extant family Suidae com¬ 
prises 15 species grouped into several genera: 
Sus (domestic and wild pigs) from Eurasia; 
Porcula (pygmy hogs) from northern India (the 
separation of Porcula from Sus is not finally 
resolved); Babyrousa (babirusa) from the island 
of Sulawesi and its satellite islands; and 
Potamochoerus (bush pig and red river hog), 
Phacochoerus (common and desert warthogs) 
and Hylochoerus (forest hog) from sub-Saharan 
Africa (Grubb, 1993a,b). The origin, evolution¬ 
ary relationships and dispersal patterns of 
Suidae remain contentious. For instance, it has 
been suggested that some modern suids from 
sub-Saharan Africa are more closely related to 
species from Eurasia than to their African 
congeners on the basis of cranial and dental 
similarities (Thenius 1970; Cooke, 1978) and 
cytochrome b sequence (Agnarsson and May- 
Collado, 2008). In contrast, other morphologi¬ 
cal analyses suggest that the modern African 
Suidae could be sister lineages (Harris and 
White, 1979; Bender, 1992; Geraads, 2004). 
This lack of consensus extends to the Asian 
congeners of the Suidae. For instance, it has 
been suggested (on the basis of mitochondrial 
DNA) that Sus saluanius (pygmy hog) deserves 
a separate status from the genus Sus, and no 
one has conclusively demonstrated whether 
Ba by ro u sa from South-east Asia or Phacochoerus 
from Africa occupies a basal position within the 
Suidae (Pickford, 1993; Funk et ai, 2007). 
Some DNA studies have contributed to under¬ 
standing the relationships within Phacochoerus 
and Sus (Randi et ai, 1996, 2002). Recently, 
a concatenated mitochondrial and nuclear DNA 
study by Gongora et ai (submitted) has provided 
new insights into the evolutionary relationships 
of Suidae as a way to resolve conflicting and 
unresolved Suidae topologies generated by 
individual sequences. This study shows that 
all sub-Saharan African genera cluster in a 
monophyletic clade separate from the Eurasian 


Sus species, and confirms that Babyrousa is 
the sister taxon to the other extant species of 
Suidae, when Tayassuidae is used as the 
out-group (Fig. 1.1). Accordingly, Gongora 
et ai (submitted) propose that the five extant 
genera of Suidae should be grouped into the 
subfamilies Babyrousinae and Suinae, with the 
latter consisting of three tribes, the so-called 
‘true’ pigs (Suini), warthogs and the forest hog 
(Phacochoerini) and the bush pig and red river 
hog (Potamochoerini). 

Subfamily Babyrousinae 
Genus Babyrousa 

Species Babyrousa babyrussa (babirusas 
on the Sula Islands and Burn Island) 
Species Babyrousa celebensis (babirusas 
from the northern arm of Sulawesi) 
Species Babyrousa togeanensis (babiru¬ 
sas restricted to the Sulawesi Togean 
islands) 

Subfamily Suinae 

Tribe Phacochoerini 
Genus Phacochoerus 

Species Phacochoerus africanus (com¬ 
mon warthog) 

Species Phacochoerus aethiopicus 
(Cape and Somali warthog) 

Genus Hylochoerus 

Species Hylochoerus meinertzhagen i 
(forest hog) 

Tribe Potamochoerini 
Genus Potamochoerus 

Species Potamochoerus porcus (red 
river hog) 

Species Potamochoerus larvatus 
(bush pig) 

Tribe Suini 
Genus Sus 

Species Sus scrofa (Eurasian wild 
boar) 

Species Sus verrucosus (Javan warty 

pig) 

Species Sus barbatus (bearded pig) 
Species Sus celebensis (Sulawesi 
warty pig) 

Species Sus philippensis (Philippine 
warty pig) 

Species Sus cebifrons (Visayan warty 

pig) 

Species Sus salvanius or Porcula 
salvanius? (pygmy hog) 
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Sus bar bat us (38) 

Sus verrucosus (38) 

Sus philippensis (?) 

Sus cebifrons (34) 

Sus scrota (36, 37, 38) 

Sus cetebensis (38) 

Potamochoerus larvatus (34) 
Potamochoerus porcus (34) 
Hylochoerus meinertzhageni (32) 

Phacochoerus africanus (34) 
Phacochoerus aethiopicus (34) 
Porcula salvanius (38) 

Babyrousa babyrussa (38) 
Babyrousa cetebensis (38) 
Babyrousa togeanensis (38) 

Pecari tajacu (30) 

Catagonus wagneri (20) 

Tayassu pecari (26) 


Fig. 1.1. A cladogram depicting the relationships in the suborder Suiformes. This tree amalgamates 
information from several recent publications that have generated mitochondrial and nuclear DNA 
sequences. Polytomies indicate a lack of resolution with respect to branching order, and the dashed line 
leading to the pygmy hog represents the current uncertainty with regard to its position within the tree. 
Well-supported nodes are marked by asterisks, and diploid chromosome numbers are placed within 
parentheses following the species name. This tree will be revised as newly elevated species are included and 
new genetic and morphological data resolve the branching order. 


The general taxonomy of the family seems 
reasonably justified, although the taxonomy of 
some genera, Sus and Babyrousa in particular, is 
likely to be the subject of future reconsiderations. 
There are indications concerning the existence of 
at least two more species: Sus bucculentus and 
Sus heureni, which have been purported to 
inhabit South Vietnam and the Flores Islands 
(Indonesia), respectively. A recent paper based 
upon mitochondrial signatures, however, demon¬ 
strated that Sus bucculentus is not sufficiently 
different from Sus scrofa to deserve full species 
recognition (Robins et al., 2006). 

Subfamily Babyrousinae 

Genus Babyrousa 

Until recently, there was only one species of 
babirusa (‘babi-rusa’, Indonesian, babi = pig, 


rusa = deer): B. babyrussa. Based upon a 
reconsideration of morphological characters, 
Meijaard and Groves (2002) proposed that this 
single species be split into three extant species, 
B. babyrussa, B. celebensis, B. togeanensis, 
and a fourth extinct species from the south¬ 
west arm of Sulawesi which they call Babyrousa 
bolabatuensis. These designations are likely to 
be altered again when DNA evidence is brought 
to bear on the question. 

The members of this genus differ signifi¬ 
cantly from Suinae in that they have long legs, 
relatively small heads and bodies mostly free 
of hair. The morphology of the stomach is 
complex. Grass comprises the main source of 
food and the typical digging or rooting behav¬ 
iour of pigs is not known. The canine teeth in 
males are considerably more developed and are 
very large and curved, sometimes creating a spi¬ 
ral. The most unusual feature is that the man¬ 
dibular canine and the maxillary canine teeth 
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are both pointed upwards, which is uncommon 
for mammals. Females usually deliver two off¬ 
spring and the adults are large in size. Babirusas 
are spread across Sulawesi and nearby islands, 
and, because they can swim well, whether they 
arrived on nearby islands by themselves or were 
ferried by humans remains an open question. 

Eleven of the autosome pairs and the X 
chromosome in this genus look very similar to 
the chromosomes of the domestic pig (Bosma 
et a/., 1991a; see also Table 7.4). Future 
molecular phylogenetic investigations may give 
a clearer answer concerning the origin of the 
babirusa. A morphological study of the pla¬ 
centa and heart anatomy indicates that the 
babirusa has a significant number of traits in 
common with pigs (MacDonald, 1994). 

Subfamily Suinae 

Tribe Phacochoerini 

genus phacochoerus. The common warthog 
(P. africanus), is widespread in many African 
countries. The name is derived from the large 
warts that are located on the muzzle, whose 
function is not known. The shape of the skull of 
the warthog differs essentially from that of other 
pigs and the number of teeth is greatly reduced. 
The canines are large, sharp and represented 
in both sexes. Adults are approximately 145- 
190 cm in length, 65-85 cm in height and 
weigh 50-150kg. Average litter size is about 
three to four piglets. The offspring are 
susceptible to the cold immediately after birth 
and therefore do not leave their burrow, where 
the temperature is around 30°C constantly. 
Their main food is grass and the animals graze 
on their knees, which causes callus 
development. Adults, in contrast, to their 
offspring, enter the burrow backwards. The 
species is spread widely in sub-Saharan Africa, 
but not in the rainforests of Western Africa. 
In Southern Africa, this species has been 
reintroduced for hunting. The diploid chro¬ 
mosome number of the common warthog is 
34 (Melander and Hansen-Melander, 1980). 

The Cape and Somali warthog, P aethi- 
opicus, was recognized by zoologists as a sepa¬ 
rate species relatively recently (Grubb, 1993b). 
However, palaeontologists came to a similar 
conclusion many years ago, mainly because of 


the lack of functional incisors in P aethiopi- 
cus. The common warthog has two incisors in 
the upper jaw and usually six in the lower jaw. 

genus hylochoerus. The forest hog (H. 
meinertzhageni) is one of the largest wild pigs 
(length 155-190 cm, height up to 110 cm and 
weight up to 250 kg). The forest hogs of East 
Africa are particularly large (Grubb, 1993b). 
The head and muzzle are very broad, and the 
snout is big and well developed for extensive 
digging. This species and genus is a relatively 
recent discovery. It was found in Kenya during 
the early part of the 20th century and is now 
known to occur throughout the tropical forest 
region of Africa. The animals are covered with 
long black hair. The biology of this species is still 
under investigation. 

Tribe Potamochoerini 

genus potamochoerus. P porcus (red river 
hog) is one of the smallest African pigs. This 
species is widely spread in the central and 
southern parts of the continent and shows a 
significant variability in colour and size. The 
animals that live in West Equatorial Africa, for 
instance, are usually very bright and red in 
colour with a white bar on the back, white hairs 
on the muzzle and brushes of long hairs on the 
ears. Males from the majority of habitats are 
characterized by the canine apophyses (located 
between the ears and the nose), which in older 
males look like two small horns directed 
backwards. Length of the body varies from 
100 to 150 cm. The height varies from 55 to 
80 cm and the weight may reach 80 kg 
(Bannikov and Flint, 1989). The number of 
offspring is three to four. The skull of P porcus 
is very much like that of Sus. Groves (1981) 
considered that the skull structure was an 
indication that these genera may be more 
closely related, or both have changed little from 
their more distant common ancestor. However, 
an absence of comparative cytogenetic (Bosma 
et aI., 1991a) and molecular data makes it 
difficult to estimate phylogenetic distance from 
other genera. 

The range of the bush pig (P. larvatus ) is 
mainly to the east and south of that of the river 
hog and it occurs not only in East and South 
Africa, but even in Madagascar. It is also bristly, 
but bristly pelage extends from the head over 
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the whole body and gives the live animal a 
shaggy, crested appearance, which is different 
from that of the river hog. The two species are 
for the most part separated territorially, but in 
some places their areas may overlap (Grubb, 
1993b). An introgression between the species 
is assumed (Kingdon, 1979). The limited 
amount of information about the biology and 
evolution of both of these species needs to be 
rectified. 


Taxonomy of the Genus Sus 

Introductory remarks 

Sus most likely originated in Island South¬ 
east Asia and then migrated into East Asia 
before heading west towards the Near East, 
North Africa and Europe. (Pickford, 1993; 
Larson et a/., 2005). The earliest fossil evi¬ 
dence of Sus (Sus arvernensis) is from the 
Late Miocene (-6-5.3 mya) of Europe, but, as 
there is no direct known ancestor from this 
continent, it is considered that this lineage 
must have arrived by dispersal from Asia (van 
der Made and Moya-Sola, 1989; van der 
Made et al ., 2006). Groves (1981) presented 
a comprehensive analysis of the taxonomy 
and phylogeny of the extant genus Sus based 
on morphological, palaeontological and bio- 
geographical data. Groves and Grubb (1993) 
revised the systematics of the genus. This 
basic analysis and newly published data are 
summarized in the following section. DNA 
and morphological analyses suggest the exist¬ 
ence of three evolutionary groups within Sus 
that could have diverged during the 
Pliocene - S. cebifrons/S. celebensis , S. bar- 
batus and S. verrucosus/S. scrofa , with Sus 
ahoenobarbus as a new species within the 
latter group (Lucchini et al., 2005). This con¬ 
trasts with the three groups (S. verrucosus, 
S. philippensis and S. scrofa) proposed by 
Groves (1981), as well as partially with other 
DNA studies that cluster S. verrucosus/S. 
barbatus as a sister clade of S. scrofa (Randi 
et al., 2002). Morphological studies of extant 
and extinct forms have divided the genus Sus 
into two groups, the primitive ‘scrofic’ and 
the derived ‘verrucosi’ (van der Made and 


Wa 


Moya-Sola, 1989); other dental studies have 
also indicated a separation of S. scrofa from 
the rest of Sus (Genov, 2004). Several stud¬ 
ies (e.g. those of Larson et al., 2005 and 
Mona et al., 2007) have pointed out that cer¬ 
tain DNA sequences widely used to discover 
the relationships between taxa were not suf¬ 
ficient to discriminate between some species 
of Sus. At present, the genus Sus comprises 
seven species. Investigations of this genus 
conducted over the past 170 years have 
taken different approaches, and the number 
of discriminated species has varied from very 
few to a total of 37. Future classifications, 
based on classical and molecular phylogenetic 
data, are likely to alter the current knowledge 
concerning Sus taxonomy and evolution. 


Sus scrofa (Eurasian wild boar) 

The earliest known fossil of S. scrofa in Europe 
is from the Early Pleistocene (-780,000 years 
ago) (van der Made, 1999; Franzen et al., 
2000); that from Asia is also from the Early 
Pleistocene (J. van der Made, personal com¬ 
munication). The modern species exists in four 
forms: domestic livestock, domestic semi-wild, 
feral and wild (Genov, 2004). S. scrofa spreads 
naturally through vast territories, and covers 
most of Europe and Asia. The species was 
introduced into North and South America, 
Australia and Oceania. Domestic pigs are very 
common in the majority of countries world¬ 
wide, except for those that have religious 
restrictions. Several features, including tooth 
and skull morphology, external proportions, 
hair and colour patterns, biochemical and 
molecular polymorphisms, ecology and behav¬ 
iour, reproductive isolation and natural areas, 
are used for discrimination of the many species 
in the genus. 

Pigs are one of the most widespread mam¬ 
malian species and S. scrofa is the primary 
ancestor of domesticated pigs, though other 
species may also have been involved (see 
Chapter 2). S. scrofa is extremely variable in 
the majority of traits studied. The number of 
subspecies is uncertain and depends upon the 
definition of the subspecies. However, it is pos¬ 
sible to discriminate at least 16 more or less 
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distinct subspecies (Groves, 1981; Groves and 
Grubb, 1993): 


S. s. scrofa 


S. s. attila 


S. s. meridionalis 

S. s. algira 
S. s. libica 


S. s. nigripes 

S. s. sibiricus 
S. s. ussuricus 
S. s. moupinensis 

S. s. leucomystax 
S. s. riukiuanus 
S. s. taiuanus 
S. s. davidi 
S. s. cristatus 

S. s. affinis 
S. s. vittatus 


Western, Central and 
parts of Southern 
Europe 

East Europe, northern 
slopes of Caucasus, 
parts of Western 
Siberia, Central and 
Western Asia 
South Spain, Corsica 
and Sardinia 
North-west Africa 
Asia Minor, Middle East, 
southern part of 
Eastern Europe 
Southern Siberia, Central 
Asia 

Eastern Siberia, Mongolia 
Russian Far East, Korea 
Eastern China, South¬ 
east Asia 
Japan 

Ryukyu Islands 
Taiwan 
Western India 
Eastern India, western 
part of Indochina 
Southern India, Sri Lanka 
Malaysia, Southern 
Indonesian Islands 


The areas of these subspecies are close and 
the level of discriminating differences may be 
quite small, involving size, colour, proportions, 
skull characters and, in several cases, chromo¬ 
some numbers. The variation in chromosome 
number is a result of two distinct Robertsonian 
translocations, which were found in the different 
geographical areas of the species (Tikhonov and 
Troshina, 1974; Bosma, 1976). The usual 
number of chromosomes in S. scrofa is 38 
(Bosma et a/., 1995). However, translocation 
I involving chromosomes 16 and 17 and trans¬ 
location II involving chromosomes 15 and 17 
were found in Kyrgyzstani and European boars 
(Tikhonov and Troshina, 1978), and reduce the 
number of chromosomes to 37 in crosses (het- 
erozygotes) and to 36 in homozygotes. 

Adaptations of these animals to different 
food and climatic conditions are dramatic. The 


flexible behaviour of the wild boar is perhaps 
one of the important features providing this 
adaptability. S. scrofa is well adapted to 
Siberian winters, tropical conditions, moun¬ 
tains and semi-deserts. Figs can tolerate 
temperatures from -50°C to +50°C due to 
well-developed thermoregulatory and nest¬ 
building behaviour. Despite being under signifi¬ 
cant human and predator pressure, populations 
of wild boar and feral pigs are very numerous 
in many parts of the world (Choquenot et a/., 

1996). 

Variations in body size are significant 
among subspecies. The largest subspecies are 
S. s. ussuricus (males up to 300 kg) and S. s. 
attila (males up to 275 kg). Generally, mature 
weight is quite variable depending on age, sex, 
food availability, season and habitat. Body and 
head length is about 130-175 cm and height 
ranges up to 100 cm. The smallest forms of 
wild boar are from South-east Asia. Detailed 
descriptions of skulls, and osteometrical stud¬ 
ies, have been published (Groves, 1981; Endo 
et a/., 1994). 

Current palaeontological knowledge 
regarding the evolution of S. scrofa is still 
limited, and fragmented data do not create a 
full-scale picture of the origin and phylogeny 
of the species. It is known that a fully evolved 
S. scrofa lived in the Biharian fauna in Europe 
and replaced the previously existing lineages 
of Sus strozzii, a possible descendant of S. 
arvernensis (Hunermann, 1969; van der 
Made, 1999). The facial shortening that 
occurred in S. celebensis and in S. scrofa 
has been used as a possible argument in 
favour of their common origin from perhaps 
the southern or the south-eastern regions of 
Asia (Groves, 1981). The spread of the two 
above-mentioned Robertsonian transloca¬ 
tions does not contradict this possibility. 
However, it is obvious that an extensive 
molecular genetics study of the problem is 
necessary before any clear conclusions can 
be drawn. Thus far, an analysis of mitochon¬ 
drial DNA from an extensive Old World sam¬ 
pling of Sus samples generally supported the 
subspecies listed above, though the resolu¬ 
tion of these DNA studies was not sufficient 
to conclusively demonstrate the existence 
of all the named subspecies (Larson et a/., 

2005). 
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Sus verrucosus (Javan warty pig) 

This species now lives mainly in Java. Two 
subspecies have been described (Groves and 
Grubb, 1993). The most typical common fea¬ 
ture is three warts on a specific location of the 
muzzle, which are strongly developed in the 
adult males. Colour varies from overall black to 
a pale red. Size also varies from relatively large 
to small. Sexual dimorphism in size is greater 
than in other species. Despite some differences 
from S. s. vittatus and S. celebensis in 
G-banding and the structure of the Y chromo¬ 
some, similarity is significant (Bosma et al., 
1991b). The closeness of S. verrucosus and S. 
s. vittatus is supported by the observation and 
precise description of interspecies hybrids in 
nature (Blouch and Groves, 1990). Several 
morphological features make S. verrucosus 
close to the other South-east Asian species, S. 
barbatus. 


Sus barbatus (bearded pig) 

The common name of S. barbatus - the bearded 
pig - is due to the elongated whiskers around 
the muzzle from the mouth to the ears. A few 
warts on the muzzle are very typical. Mature size 
varies significantly between several subspecies 
and is on average close to that of S. scrofa. The 
length ranges from 100 to 160 cm and the 
weight is approximately 100 kg. Some males 
are much bigger. The bearded pig inhabits the 
Malaysian peninsula, Sumatra, Java, Borneo, 
Palau, Bangka, Palawan and some other islands. 
S. barbatus sometimes migrates, and these 
migrations involve thousand of animals. Fertile 
hybrids with S. scrofa obtained in captivity are 
known (Blouch and Groves, 1990). 


Sus celebensis (Sulawesi warty pig) 

This wild pig from Sulawesi and several other 
islands, including possibly Timor, has been rec¬ 
ognized as a separate species from S. verruco¬ 
sus by Groves (1981). Cytogenetic analysis 
strongly supported this point, though the struc¬ 
ture of the Y chromosome has some differ¬ 
ences from that of S. verrucosus (Bosma et al., 


1991b). Animals are usually black with a few 
white or yellowish hairs intermixed, and they 
have crown tufts of hair. Other colour types 
have been described. The muzzle is short, like 
the Eurasian wild pig, and is of small size. Legs 
are also short, and the pig has small short ears 
with a relatively large head. This species is found 
on Sulawesi and other offshore islands. There are 
indications that S. celebensis was domesti¬ 
cated during the early Holocene and spread as 
far as Roti, a medium-sized island south-west 
of New Guinea, where the pigs are living now 
(Groves, 1981). In other places, they have 
probably been replaced by domestic S. s. vittatus. 
Based upon morphological characters, Groves 
originally claimed that the several forms of wild 
pigs in New Guinea could be a result of hybridi¬ 
zation between S. s. vittatus and S. celebensis 
(Groves, 1981). 

Recent papers based upon genetic evidence 
have suggested a different alternative. First, mito¬ 
chondrial signatures obtained from S. celebensis 
samples on the island of Sulawesi are not mono- 
phyletic. Instead, two clades, one made up of 
samples from the northern arm of the island and 
one made up of southern samples, cluster within 
other S. scrofa, S. barbatus and S. verrucosus 
samples, suggesting not just two species, but that 
each may have arrived on to the island inde¬ 
pendently (Larson etal., 2005, 2007). Secondly, 
pigs from New Guinea possess a completely 
separate signature known as the Pacific Clade, 
and, though this does not rule out hybridization 
with S. celebensis, it strongly suggests that the 
maternal heritage of pigs on New Guinea and 
throughout the Pacific was derived from an 
Asian wild boar endemic to Peninsular South¬ 
east Asia (Larson et al., 2007). 

Sus philippensis (Philippine warty pig) 

According to the latest information, there are 
sufficient arguments to discriminate S. philip¬ 
pensis from S. celebensis and S. barbatus 
(Groves, 1981; Groves and Grubb, 1993). The 
species occurs on several islands of the eastern 
Philippines. The colour is black, sometimes 
with a pale snout band and red-brown patches 
in the mane. This pig is smaller than S. barba¬ 
tus. Further investigations of the species are 
desirable. 
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Sus cebifrons (Visayan warty pig) 

S. cebifrons is a small pig that occurs allopatri- 
cally to S. philippensis on the west-central 
islands of the Philippines (Groves and Grubb, 
1993). Data on the biology of this species are 
very limited. 


The pygmy hog: Sus salvanius 
or Porcula salvanius ? 

This is the smallest pig. The taxonomy is still 
controversial as follows from the discussion 
below. The pygmy hog does not have warts. 
Body and head length is 66-7 lcm in males 
and 55-62 cm in females. The corresponding 
shoulder height is 23-30 cm and 20-22 cm, 
and weight is 9-10 kg and 6-7 kg in males and 
females, respectively (Mallinson, 1977). The 
basic colour is dark brown. Structure of the 
skull differs significantly from that of S. scrofa. 
The number of pairs of teats is three, instead of 
the six pairs typical of other Sus species, and 
the number of piglets born is usually three to 
four. Ears are large and rounded. The tail is 
very short and the inner toes are short com¬ 
pared with other pig species (Groves, 1981). 
The pygmy hog is currently distributed in quite 
a narrow part of northern Assam (India) in the 
long-grass belt. The number of animals in the 
area is very small. This species is, therefore, 
considered to be endangered. The diploid 
chromosome number is 38. Comparative analy¬ 
sis of G-bands shows that the chromosomes of 
the pygmy hog are very similar to those of the 
domestic pig and those of wild S. scrofa, which 
possess 2n = 38 chromosomes (Bosma et al., 
1983). Except for the small size of the body 
there are relatively few characters that may 
serve as diagnostic for discrimination of this 
species from S. scrofa. 

The original description of this species by 
Hodgson in 1847 placed it as the sole species 
within the genus Porcula. This was overturned 
by Groves (1981), who used a series of mor¬ 
phological markers to assign the species as a 
member of the genus Sus. A recent molecular 
study of modern and museum specimens of 
this species, however, demonstrated that 
pygmy hogs possess a degree of genetic differ¬ 


ence and that phylogenetic inference places 
them outside the Sus genus on a phylogenetic 
tree (Funk et al., 2007). Even though the tree 
did not possess enough resolution to confi¬ 
dently ascertain the relationship of Porcula to 
the other genera, its difference from Sus was 
robustly demonstrated. These observations led 
Funk et al. (2007) to suggest that the genus 
Porcula be revived and that Hodgon’s original 
classification was correct, although future 
reconsiderations using additional data may 
overturn this designation again. 


Interrelationships of the species 

in genus Sus 

A high level of morphological similarities 
between all species of the genus is an argu¬ 
ment in favour of their relatively recent origin 
from a common ancestor. Their same chromo¬ 
some number and their high level of homology 
support this conclusion. However, these close 
relationships complicate phylogenetic recon¬ 
struction. A possible phylogeny of the genus 
Sus is presented in Fig. 1.1 based upon an 
amalgamation of trees presented in a series of 
genetics papers. 

It follows from the previous species 
descriptions that, in several cases, different 
Sus species coexist in the same area, yet have 
maintained significant differences in morphol¬ 
ogy, ecology and behaviour. This may be rea¬ 
sonably explained by a reproductive isolation 
that appears to exist between the species which 
may have contact. This is applicable to S. 
scrofa and the pygmy hog in northern India, 
and to S. scrofa and the ‘Indonesian’ species: 
S. barbatus, S. verrucosus and S. celebensis. 
None the less, interspecies hybridization with 
S. scrofa can occur, and this probably indicates 
a limited reproductive isolation (Groves, 1983). 
The production of fertile hybrids between a 
European wild boar and S. barbatus sows has 
been reported (Lotsy, 1922). Hybridization 
between S. scrofa and S. verrucosus has been 
recorded fairly recently in Java (Blouch and 
Groves, 1990). According to Groves (1996, 
personal communication) in some parts of the 
Philippines the indigenous wild pigs (especially 
S. cebifrons) are in danger of being hybridized 
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out of existence by crossing with feral domestic 
pigs. Groves (1981) assumed that the ances¬ 
tors of these species were separated at least in 
the Middle Pleistocene, but this may have hap¬ 
pened much longer ago. 

A hypothetical scenario of Sus evolution 
has been suggested by Groves (1981). He pro¬ 
posed that the S. verrucosus-S. barbatus lin¬ 
eage, which has been present from the beginning 
of the Pliocene in Europe, entered Indonesia 
about 2mya. It appears that these animals 
cohabited with the older S. celebensis lineage. 
S. scrofa may have possibly evolved out of the 
S. celebensis lineages and entered Europe 
about 700,000 years ago, where it replaced S. 
uerru cos us-like pigs. The high level of similar¬ 
ity in chromosome structure of S. scrofa and 
S. celebensis does not contradict this hypoth¬ 
esis. Several independent sets of data support 
a Far East origin of S. scrofa and a steady 
spreading in a westerly direction. The previ¬ 
ously mentioned Robertsonian translocations, 
which are typical for some Siberian, Central 
Asian and European populations, probably 
appeared and became fixed in the populations 
after or during their western movement. 
Numerous investigations have been devoted to 
comparing the geographical distribution of 
alleles for blood group antigens, isoenzymes 
and other proteins, but have not been directly 
used as arguments in resolution of the problem 
of the species origins (Gorelov, 1994). This 
story, generated from morphological and bio¬ 
logical data, has recently been supported by 
DNA studies. The most likely scenario is that 


S. scrofa originated in Island South-east Asia, 
and migrated first across the Kra Isthmus into 
Peninsular East Asia, whereupon Sus radiated 
and diversified across East Asia. From there, 
the species spread west into Central Asia, the 
Near East and finally into Europe and North 
Africa (Larson et al., 2005). 


Conclusions 

The information presented in this chapter gives 
a general overview of the systematic position and 
phylogeny of the wild ancestors of the domestic 
pig. The family Suidae appeared during the evo¬ 
lution of the early Oligocene, some time after 
separation of the suborder Suiformes from other 
Artiodactyla. From a morphological point of 
view, the Suiformes are more primitive and less 
specialized. 

Members of the Suidae have spread widely 
across Africa, Europe and Asia. Sus itself 
appeared in the Lower Pliocene at least 
3-5 mya in Europe and Indonesia. A distribu¬ 
tion of these ancestor species through the 
Indonesian Islands was possibly essential for 
speciation within the genus. 

One of these species, S. scrofa, was 
tremendously successful and spread through 
Asia and Europe, replacing previous species. 
A number of more or less distinctive subspecies 
emerged, and some of them were independ¬ 
ently involved in the domestication process 
that began over 10,000 years ago. 


References 


Agnarsson, I. and May-Collado, L.J. (2008) The phylogeny of Cetartiodactyla: the importance of dense taxon 
sampling, missing data, and the remarkable promise of cytochrome b to provide reliable species-level 
phylogenies. Molecular Phylogenetics and Evolution 48, 964-985. 

Andrea, M.V., Oliveira, C., Rocha, G.T. and Foresti, F. (2001) Cytogenetical and histological studies in testis 
of Tayassu tajacu (cateto), Tayassu pecari (queixada) and a natural interspecific hybrid. Journal of 
Animal Breeding and Genetics 118,125-133. 

Bannikov, A.G. and Flint, V.E. (1989) Order Artiodactyla. In: Sokolov, V.E. (ed.) Life of Animals. Mammals, 
Volume 7. Prosveshenie, Moscow, pp. 426-522 (in Russian). 

Bender, RA. (1992) A reconsideration of the fossil suid Potamochoeroides shawi, from the Makapansgat 
Limeworks, Potgietersrus, Northern Transvaal. Navorsinge van die Nasionale Museum Bloemfontein 
8, 1-67. 

Benirschke, K. and Kumamoto, A.T. (1989) Further studies on the chromosomes of three species of peccary. 
In: Redford, K.FI. and Eisenberg, J.F. (eds) Advances in Neotropical Mammalogy. Sandhill Crane Press, 
Gainesville, Florida, pp. 309-316. 



Systematics and Evolution of the Pig 


11 


Benirschke, K., Kumamoto, A.T. and Merrit, D.A. (1985) Chromosomes of the Chacoan peccary, Catagonus 
wagneri (Rusconi). Journal of Heredity 76, 95-98. 

Blouch, R.A. and Groves, C.P. (1990) Naturally occurring suid hybrid in Java. Zeitschrift fur Saugetierkunde 
55, 270-275. 

Bosma, A.A. (1976) Chromosomal polymorphism and G-banding patterns in the wild boar ( Sus scrofa L.) 
from the Netherlands. Genetica 46, 391-399. 

Bosma, A.A., Oliver, W.L.R. and MacDonald, A.A. (1983) The karyotype, including G- and C-banding patterns, 
of the pigmy hog Sus ( Porcula) silvanius (Suidae, Mammalia). Genetica 61,99-106. 

Bosma, A.A., de Haan, N.A., Blouch, R.A. and MacDonald A.A. (1991a) Comparative cytogenetic stud¬ 
ies in Sus verrucosus, Sus celebensis and Sus scrofa vittatus (Suidae, Mammalia). Genetica 83, 
189-194. 

Bosma, A.A., de Haan, N.A. and MacDonald, A.A. (1991 b) The current status of cytogenetics of the Suidae: 
a review. Fradrich-Jubilaumsband, Zoo Berlin, Bongo 18, 258-272. 

Bosma, A.A., de Haan, N.A., Mellink, C.H.M., Yerle, M. and Zijlstraa, C. (1995) Chromosome homology 
between the domestic pig and the babirusa (family Suidae) elucidated with the use of porcine painting 
probes. Cytogenetics and Cell Genetics 75, 32-35. 

Choquenot, D., Mcllroy, J. and Korn, T. (1996) Managing Vertebrate Pests: Feral Pigs. Bureau of Resource 
Sciences, Australian Publishing Service, Canberra. 

Cooke, H.B.S. (1978) Suid evolution and correlation of African hominid localities: an alternative taxonomy. 
Science 201,460-463. 

Ducrocq, S. (1994) An Eocene peccary from Thailand and the biogeographical origins of the artiodactyl 
family Tayassuidae. Palaeontology 37, 765-779. 

Ducrocq, S., Chaimanee, Y., Suteethourn, V. and Jaeger, J.-J. (1998) The earliest known pig from the upper 
Eocene of Thailand. Paleontology 41, 147-156. 

Endo, H., Kurohmaru, M. and Hayashi, Y. (1994) An osteometrical study of the cranium and mandible of 
Ryukyu wild pig in Iriomote Island. Journal of Veterinary and Medical Science 56, 855-860. 

Franzen, J.L., Gliozzi, E., Jellinek, T., Scholger, R. and Weidenfeller, M. (2000) Die spataltpleistozane 
Fossillagerstatte Dorn-Durkheim 3 und ihre Bedeutung fur die Rekonstruktion der Entwicklung des 
rheinischen FluBsystems [The late early Pleistocene fossil lagerstaette(n?) Dorn-Durkheim 3 and its 
significance for reconstructing the development of the Rhenish fluvial system]. Senckenbergiana 
Lethaea 80, 305-353. 

Funk, S.M., Verma, S.K., Larson, G., Prasad, K., Singh, L., Narayan, G. and Fa, J.E. (2007) The pygmy hog 
is a unique qenus: 19th century taxonomists qot it riqht first time round. Molecular Phyloqenetics and 
Evolution 45, 427-436. 

Genov, P.V. (2004) Craniometric characteristics of the subgenus Sus Linnaeus, 1758 and a systematic 
conclusion. Galemys 16, 9-23. 

Geraads, D. (2004) New skulls of Kolpochoerus phacochoeroides (Suidae: Mammalia) from the Late 
Pliocene of Ahl al Oughlam, Morocco. Palaeontologia Africana 40, 69-83. 

Gongora, J. and Moran, C. (2005) Nuclear and mitochondrial evolutionary analyses of collared, white- 
lipped, and Chacoan peccaries (Tayassuidae). Molecular Phylogenetics and Evolution 34, 181-189. 

Gongora, J., Morales, S., Bernal, J.E. and Moran, C. (2006) Phylogenetic divisions among collared peccaries 
(Pecari tajacu) detected using mitochondrial and nuclear sequences. Molecular Phylogenetics and 
Evolution 41, 1-11. 

Gongora, J., Cuddahee, R.E., Nascimento, F.F., Damayanti, C.S., Palgrave, C.P, Lowden, S., Simond, D., 
White, D.J., Tay, W.T., Ho, S.Y.W., Randi, E., Klingel, H., Rodriguez, C.J., Moran, C. and Larson, G. 
(submitted) Rethinking the evolution of sub-Saharan African suids (Suidae, Artiodactyla). 

Gorelov, I.G. (1994) Biologiya Sibirskogo Kabana [Biology of the Siberian Wild Boar]. Institute of Cytology 
and Genetics, Novosibirsk (in Russian). 

Groves, C. (1981) Ancestor for the pigs: taxonomy and phytogeny of the genus Sus. Technical Bulletin No. 
3. Department of Prehistory, Research School of Pacific Studies, Australian National University, 
Canberra. 

Groves, C.P. (1983) Pigs east of the Wallace line. Journal de la Societe des Oceanistes 39, 105-119. 

Groves, C.P. and Grubb, P. (1993) The Eurasian suids: Sus and Babyrousa. In: Oliver, W.L.R. (ed.) Pigs, 
Peccaries and Hippos. International Union for Conservation of Nature (IUCN), Gland, Switzerland, 
pp. 107-111. 

Grubb, P. (1993a) Order Artiodactyla. In: Wilson, D.E. and Reeder, D.M. (eds) Mammalian Species of the World: 
a taxonomic and geographic reference. Smithsonian Institution Press, Washington, DC, pp. 374-414. 



12 


A. Ruvinsky et al. 


Grubb, R (1993b) The Afrotropical suids: Phacochoerus, Hylochoerus, and Potamochoerus. In: Oliver, 
W.L.R. (ed.) Pigs, Peccaries and Hippos. International Union for Conservation of Nature (IUCN), 
Gland, Switzerland, pp. 66-75. 

Harris, J.M. and White, T.D. (1979) Evolution of Plio-Pleistocene African Suidae. Transactions of the American 
Philosophical Society 69, 1-128. 

Huffman, B. (2009) Your guide to the world’s hoofed mammals. Available at: http://www.ultimateungulate. 
com (accessed 25 November 2009). 

Hunermann, K.A. (1969) Sus scrota priscus Goldfuss im Pleistozan von Sussenborn bei Weimar [Sus 
scrota priscus Goldfuss in the Pleistocene of Sussenborn by Weimar]. Palaontologische Abhandlungen 
A3, 611-616. 

Kingdon, J. (1979) East African Mammals: An Atlas of Evolution in Africa, Volume III (B) Large Mammals. 
Academic Press, London. 

Larson, G., Dobney, K., Albarella, U., Fang, M., Matisoo-Smith, E., Robins, J., Lowden, S., Finlayson, H., 
Brand, T., Willerslev, E., Rowley-Conwy, P, Andersson, L. and Cooper, A. (2005) Worldwide phyloge- 
ography of wild boar reveals multiple centers of pig domestication. Science 307, 1618-1621. 

Larson, G., Cucchi, T., Fujita, M., Matisoo-Smith, E., Robins, J., Anderson, A., Rolett, B., Spriggs, M., 
Dolman, D.,Tae-Hun Kim, T.-H., Thuy, N.T.F., Randi, E., Doherty, M., Due, R.A., Bollt, R., Djubiantono, 
T., Griffin, B., Intoh, M., Keane, E., Kirch, P, Li, K.-T., Morwood, M., Pedrina, L.M., Piper, P.J., Rabett, 
R.J., Shooter, R, van den Bergh, G., West, E., Wickler, S., Yuan, J., Cooper, A. and Dobney, K. (2007) 
Phylogeny and ancient DNA of Sus provide new insights into Neolithic expansion in Island Southeast 
Asia and Oceania. Proceedings of the National Academy of Sciences of the USA 104, 4834-4839. 

Liu, L.-P. (2003) Chinese Fossil Suoidea Systematics, Evolution, and Paleoecology. Helsinki University 
Printing House, Helsinki. 

Lotsy, J.P (1922) Die Aufarbeitung des Kuhn’schen Kreuzungs materials im Institut fur Tierzucht der 
Universitat Halle. [The processing of the Kuhn’schen Kreuzungs materials in the Institute of 
Stockbreeding of the University of Halle.] Genetica 4, 32-61. 

Lucchini, V., Meijaard, E., Diong, C.H., Groves, C.P and Randi, E. (2005) New phylogenetic perspectives 
among species of South-east Asian wild pig ( Sus sp.) based on mtDNA sequences and morphometric 
data. Journal of Zoology 266, 25-35. 

MacDonald, A.A. (1994) The placenta and cardiac foramen oval of the babirusa (Babyrousa babyrussa). 
Anatomy and Embryology 190, 489-494. 

Mallinson, J.J.C. (1977) Breeding of the pigmy hog Sus salvanius (Hodgson) in northern Assam. Journal of 
the Bombay National History Society 74, 288-289. 

Meijaard, E. and Groves, C.P. (2002) Upgrading three subspecies of babirusa (Babyrousa sp.) to full species 
level. Asian Wild Pig News 2, 33-39. 

Melander, Y. and Hansen-Melander, E. (1980) Chromosome studies in African wild pigs (Suidae, Mammalia). 
Hereditas 92, 209-216. 

Mona, S., Randi, E. and Tommaseo-Ponzetta, M. (2007) Evolutionary history of the genus Sus inferred 
from cytochrome b sequences. Molecular Phylogenetics and Evolution 45, 757-762. 

Murphy, W.J. and Eizirik, E. (2009) Placental mammals (Eutheria). In: Blair Hedges, S. and Kumar, S. (eds) 
The Timetree of Life. Oxford University Press, Oxford, UK, pp. 471-474. 

Murphy, W.J., Eizirik, E., Johnson, W.E., Zhang, Y.P, Ryder, O.A. and O’Brien, S.J. (2001) Molecular phylo¬ 
genetics and the origins of placental mammals. Nature 409, 614-618. 

Novacek, M.J. (1992) Mammalian phylogeny: shaking the tree. Nature 356, 121-125. 

Novacek, M.J. (2001) Mammalian phylogeny: genes and supertrees. Current Biology W, R573-R575. 

O’Leary, M.A. and Gatesy, J. (2008) Impact of increased character sampling on the phylogeny of 
Cetartiodactyla (Mammalia): combined analysis including fossils. Cladistics 24, 397-442. 

Pickford, M. (1993) Old World suoid systematics, phylogeny, biogeography and biostratigraphy. Paleontologia 
i Evolucio 26-27, 237-269. 

Pickford, M. (2006) Synopsis of the biochronology of African Neogene and Quaternary Suiformes. 
Transactions of the Royal Society of South Africa 1 6 , 51-62. 

Price, S.A., Bininda-Emonds, O.R.P. and Gittleman, J.L. (2005) A complete phylogeny of the whales, dol¬ 
phins and even-toed hoofed mammals (Cetartiodactyla). Biological Review 80, 445-473. 

Randi, E., Lucchini, V. and Diong, C.H. (1996) Evolutionary genetics of the Suiformes as reconstructed 
using mtDNA sequencing. Journal of Mammalian Evolution 3, 163-194. 

Randi, E., D’Huart, J.-P, Lucchini, V. and Aman, R. (2002) Evidence of two genetically deeply divergent species 
of warthog, Phacochoerus africanus and P. aethiopicus (Artiodactyla: Suiformes) in Africa. Mammalian 
Biology 67, 91-96. 



Systematics and Evolution of the Pig 


13 


Robins, J.H., Ross, H.A., Allen, M.S. and Matisoo-Smith, E. (2006) Taxonomy: Sus bucculentus revisited. 
Nature 440, E7. 

Sowls, LK. (1997) Javelinas and Other Peccaries: Their Biology, Management, and Use, 2nd edn. Texas 
A&M University Press, College Station, Texas. 

Spaulding, M., O’Leary, M.A. and Gatesy, J. (2009) Relationships of Cetacea (Artiodactyla) among mammals: 
increased taxon sampling alters interpretations of key fossils and character evolution. PLoS One 4, 
e7062. 

Theimer, T.C. and Keim, P. (1998) Phylogenetic relationships of the peccaries based on cytochrome b DNA 
sequences. Journal of Mammalogy 79, 566-572. 

Thenius, E. (1970) Zur Evolution und Verbreitungsgeschichte der Suidae (Artiodactyla, Mammalia) [The evo¬ 
lution and distribution history of the Suidae (Artiodactyla, Mammalia)]. Zeitschrift fur Saugettirkunde 35, 
321-342. 

Thewissen, J.G.M., Cooper, L.N., Clementz, M.T., Bajpai, S. and Tiwari, B.N. (2007) Whales originated 
from aquatic artiodactyls in the Eocene epoch of India. Nature 450, 1190-1195. 

Tikhonov, V.N. and Troshina, A.I. (1974) Identification of chromosomes and their aberrations in karyotypes 
of subspecies of Sus scrofa L. by differential staining. Doklady Akademii Nauk SSSR 214, 932-935. 

Tikhonov, V.N. and Troshina, A.I. (1978) Production of hybrids of wild and domestic pigs with two chromo¬ 
somal translocations. Doklady Akademii Nauk SSSR 239, 713-716. 

van der Made, J. (1999) Ungulates from Atapuerca TD6. Journal of Human Evolution 37, 389-413. 

van der Made, J. and Moya-Sola, S. (1989) European Suinae (Artiodactyla) from the Late Miocene onwards. 
Bolletino della Societa Paleontologica Italiana 28, 239-339. 

van der Made, J., Morales, J. and Montoya, P. (2006) Late Miocene turnover in the Spanish mammal record 
in relation to palaeoclimate and the Messinian Salinity Crisis. Palaeogeography, Palaeoclimatology, 
Palaeoecology 238, 228-246. 

Wetzel, R.M. (1977a) The Chacoan peccary Catagonus wagneri (Rusconi). Bulletin of the Carnegie 
Museum of Natural History 3, 1 -36. 

Wetzel, R.M. (1977b) The extinction of peccaries and a new case of survival. Annals of the New York 
Academy of Sciences 288, 538-544. 

Wright, D.B. (1998) Tayassuidae. In: Janis, C.M., Scott, K.M. and Jacobs, L.L. (eds) Evolution of Tertiary 
Mammals of North America. Cambridge University Press, Cambridge, UK, pp. 389-400. 



2 Genetic Aspects of Pig 

Domestication 

_ J 


Greger Larson , 1 Thomas Cucchi 2 - 3 and Keith Dobney 3 

1 Durham University, UK; 2 Archeozoologie, Archeobotanique: Societes, Pratiques et 

Environnements’, France ; 3 University of Aberdeen, UK 


Introduction 14 

The Process of Domestication 15 

Pig Domestication Across the Old World 16 

A historical perspective 16 

A general perspective 17 

The Near East 20 

Europe 21 

China 23 

Peninsular South-east Asia, Island South-east Asia and Oceania 27 

India 30 

North Africa 31 

Future Directions and Conclusions 31 

References 33 


Introduction 

Members of the Suidae family include five gen¬ 
era and as many as 15 different species, 
although the taxonomy of this group remains 
uncertain (see Chapter 1). Within this family, 
the genus Sus is represented by at least six spe¬ 
cies, the most geographically diverse of which 
is Sus scrofa. This species occupies a large 
number of ecological niches across the entirety 
of the Old World, and, though claims for the 
domestication of several species from this 
genus have been made (e.g. Groves, 1981), it 
is still generally accepted that only S. scrofa 
has been fully domesticated. It is certainly the 
case from morphological, behavioural and 
(most recently) genetic studies that almost every 
domesticated pig in the world today derives 
from a wild S. scrofa ancestor. Having said 
that, it is important to note that wild boar have 
diversified into numerous regionally distinct 

14 


populations (sometimes classified as subspe¬ 
cies), and that several of these are likely to 
either have been independently domesticated, 
or have contributed genes to modern domestic 
pigs through hybridization. 

The purpose of this chapter is to describe 
each of these populations in turn, the genetic evi¬ 
dence for domestication and the subsequent 
migration of populations of domestic pigs with 
people along migratory routes across the Old 
World. Though modern pig breeds are direct 
descendants of the earliest domestic pigs, the his¬ 
tory of the most widespread commercial breeds 
and their differentiation from one another extends 
only 200 years into the past, a small fraction of 
the -10,000 years since the first appearance of 
domestic pigs in the archaeological record. The 
historical development of modern breeds is inter¬ 
esting and has resulted in a tremendous degree of 
variation (see Chapter 18). This chapter, how¬ 
ever, focuses instead on the deeper time of early 
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domestication in order to establish the origins of 
modern pigs. 

Before doing this it is worth discussing the 
process of domestication itself. One of the 
main questions people ask about animal 
domestication is: ‘Why and when did humans 
first domesticate them?’ This question is based 
upon the premise that domestication was a 
goal oriented, intentional act carried out by 
people who saw in wild populations the poten¬ 
tial of having smaller, more docile versions 
under their direct care. All of the available evi¬ 
dence suggests that this was the least likely 
scenario, and that the process was neither 
intentional nor rapid. Thus, before proceeding 
to the primary purpose of this chapter, we will 
explore what the terms ‘domestication’ and 
‘domesticated animal’ mean, and how we can 
possibly hope to recognize the various stages 
of the process both in the archaeological record 
and in the genomes of modern domestic pigs. 

The Process of Domestication 

In his review of the role of animal behaviour in 
domestication, Price (1984, p. 3) rightly states 
that ‘it is difficult to formulate a definition of 
domestication that is general enough to account 
for the wide variation observed in different spe¬ 
cies, in different captive environments, yet spe¬ 
cific enough to be meaningful in terms of the 
biological processes involved’. Nevertheless, 
he proceeds to attempt to define domestica¬ 
tion as ‘that process by which a population of 
animals becomes adapted to man and to the 
captive environment by some combination of 
genetic changes occurring over generations, 
and environmentally induced developmental 
events reoccurring during each generation’. In 
a nutshell, Price’s definition highlights ‘captivity’ 
(i.e. direct human control) as a basic catalyst 
for the process. 

Though the term ‘domestic animal’ car¬ 
ries universal meaning, the terminology typi¬ 
cally employed by those studying these animals 
is often confusing and poorly defined. The pri¬ 
mary reason for this stems from the inherent 
difficulty of assigning static terms to what is 
clearly a process likely to involve long-term and 
continuous change (Dobney and Larson, 


2006). The terms ‘wild’ and ‘domestic’ have 
long been (and still largely are) terms that have 
been used to describe simple states of being, 
rather than what they actually represent - the 
extremes of a principally biological process 
driven by selection pressures, some of which 
are intimately linked with various and diverse 
aspects of human culture. As a result of this 
somewhat more complex view of human- 
animal relationships, a number of ‘intermediate’ 
stages of domestication have been proposed, 
e.g. ‘cultural control’ (Hecker, 1982; Hongo 
and Meadow, 1998, 2000), ‘pre-domestic’ 
(Vigne and Buitenhuis, 1999) and ‘intermedi¬ 
ary stage’ (Ervynck et a/., 2002). 

A vast range of human-animal relation¬ 
ships has existed throughout history and the 
animals involved in many of these relationships 
cannot be easily categorized as strictly wild or 
strictly domestic. Nor is it credible that the 
range of species involved in these relationships 
took the same trajectory along the path that 
led to ‘complete’ domestication. The fluid 
nature of these relationships has led some 
authors to question whether the term ‘domes¬ 
tication’ (at least as it is traditionally perceived) 
can even be applied to some species. The 
unique biology and behaviour of, for example, 
pigs present special challenges to the study of 
their domestication that have caused some to 
question whether the threshold we term 
‘domestication’ is really relevant to them 
(Jarman, 1976; Zvelebil, 1995). 

Thus, any paradigm that relies on a strict 
wild/domestic dichotomy prevents a deeper 
appreciation of those animals whose lives are 
spent somewhere in between. More impor¬ 
tantly, because this sort of dichotomous per¬ 
spective rules out long-term evolutionary 
change as an explanation of the process of 
domestication, it therefore (however uninten¬ 
tionally) both obscures the existence of transi¬ 
tional forms and prevents any real understanding 
of the domestication process itself (Dobney 
and Larson, 2006). 

It is perhaps best to model the process of 
domestication as a continuous trajectory divided 
into a series of diffuse stages during which dif¬ 
fering selection pressures vary in intensity, sub¬ 
sequently allowing separate selective pressures 
to begin asserting themselves. Understanding 
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other naturally occurring and mutually benefi¬ 
cial/altering relationships that exist between 
other organisms (symbiotic, mutualistic, com¬ 
mensal, or even parasitic) is perhaps more use¬ 
ful in understanding the initial stage of this 
process, one probably not readily visible in the 
archaeological record and in which humans are 
likely to have played little or no direct role. 
Zeder (2006) has taken this idea a step further 
and envisioned the process as an evolving 
mutualism between populations of people and 
populations of plants and animals. At its most 
extreme, this idea suggests that humans, plants 
and animals have all evolved with a biological 
and/or cultural dependence on one another to 
the point where each is necessary for the oth¬ 
er’s survival. Direct analogies can also be found 
in the plant world, as has been demonstrated 
recently by Fuller et al. (2010). 

This dependency began with an initial 
phase of domestication, during which the rela¬ 
tionship between humans and, for example, 
wild boar began to grow in intensity, and is 
likely to have predisposed them to eventual full 
domestication. As tamer wild boar ventured 
closer to human settlements (probably attracted 
by new scavenging opportunities created by 
human settlements in the form of waste and 
refuse), physiological, phenotypic and even 
genetic changes perhaps would already have 
begun. However, it was not until humans pur¬ 
posefully enhanced the selection pressure upon 
that behaviour (by captivity and control over 
reproduction, in turn leading to the enhance¬ 
ment and exaggeration of behavioural and 
phenotypic changes) that true pigs would 
become permanent members of human settle¬ 
ments and more readily recognizable (to our 
modern eyes) as domestic pigs. These pigs 
would have been significantly different from 
their wild progenitors at all levels of biological 
organization, from their genomes to their 
bones. Until recently, accessing the genetic 
evidence of the archaeologically preserved 
bones and teeth of long-dead organisms was 
not possible, and certainly not as well estab¬ 
lished as the zooarchaeological methods used 
to interpret the bones themselves. 

Because domestication (like speciation) is a 
continuum, deriving a specific definition that 
differentiates wild and domestic animals is 
necessarily futile, but that fact makes it more 


difficult to recognize domestic animals in the 
zooarchaeological record. Animals at various 
points along the cline from wild to domestic 
may change their biology, behaviour and atti¬ 
tude towards humans in many different ways 
and to varying degrees, and it therefore follows 
that changes along the continuum cannot be 
analysed in an unequivocal manner. Instead, a 
more reasoned approach becomes necessary, 
one that employs a diversity of approaches and 
contextual arguments. In addition to the com¬ 
plexity of the domestication process per se, fac¬ 
tors that provide the context for domestication 
(e.g. related climatic, environmental, geograph¬ 
ical, chronological and cultural variables) must 
also be considered (Albarella et a/., 2006a). 

Any real understanding of domestication 
must initially be predicated on an appreciation of 
domestication as a process. Once it is understood 
first that wild and domestic are the terminal ends 
of a complex continuum, and secondly that 
plants and animals have played just as active a 
role as people in the process, then we can begin 
to answer the fundamental questions regarding 
when and how our modern world was populated 
by larger populations of domestic animals than 
people. Lastly, it is also worth recognizing the 
value of an approach that combines methods 
from a wide variety of academic disciplines, 
including archaeology and genetics. By adding 
multiple contexts, the understanding of domesti¬ 
cation in general, and of pig domestication spe¬ 
cifically, can only benefit. 


Pig Domestication Across 
the Old World 

A historical perspective 

The distribution of pigs and their close relatives 
across Europe, North Africa and Asia (Groves, 
1981; Oliver et al ., 1993) means that, unlike 
other domestic animals whose wild ancestors 
are more regionally restricted (e.g. sheep and 
goats), a simple diagnosis of pig domestication 
based upon geography is impossible. The very 
earliest excavated and studied assemblages of 
animal bones from various European sites such 
as Danish shell middens, Swiss lake dwellings, 
Italian terramare settlements and others 
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included teeth and bones of pigs, and by the 
end of the 19th century it was recognized that 
at least two main forms, interpreted as wild 
and domestic pigs, were represented by the 
remains (Albarella et a/., 2006a). 

In his early attempts to distinguish between 
the two, Winge (1990) believed that prehistoric 
European domestic pigs (which, following earlier 
authors, he termed S. s. domesticus) were 
descendants of the wild boar of Europe, north¬ 
ern Asia and North Africa (S. s. ferus), while he 
thought modern domestic pigs in South-east 
Asia were derived from local wild boar that he 
suspected might be a different species (Sus uit- 
tatus). Before and immediately following the 
Second World War, work in the Near East began 
revealing very early evidence for agriculture 
stretching back to the beginning of the Holocene 
(-10,000 years ago). Along with sheep, goats 
and cattle, pigs appeared to be an important 
early domesticate, present on some of these 
early farming (Neolithic) sites (Flannery, 1983). 
As a result of these new data, the dominant view 
of the mid-20th century was that pigs (along 
with sheep, goats, cattle and domesticated cere¬ 
als such as barley and wheat) were domesticated 
in the Near East and brought to Europe by immi¬ 
grant farmers (Childe, 1958). A few, like Sauer 
(1952), preferred a South-east Asian origin for 
domestic pigs, but so little archaeological work 
had been done in this region that there was not 
enough evidence to support such a claim. 

Although the majority of researchers 
believed in a limited geographical origin for 
most animal domestication - in the case of 
pigs, just the Near and Far East, later spread¬ 
ing west and east with Neolithic farmers - this 
hypothesis was challenged, and even Winge’s 
metrical separation of wild boar and domestic 
pigs was questioned (Higgs and Jarman, 1969). 
The picture was then further complicated by a 
case made for ‘intermediate’ or ‘semi-domestic’ 
pigs under extensive human control, suggesting 
that closer relationships between geographi¬ 
cally widely distributed wild and the semi¬ 
domestic pigs could have occurred anywhere 
within the larger distribution, not just in the 
previously recognized domestication centres 
(e.g. Jarman, 1976; Zvelebil, 1995). 

Within the last two decades, two compet¬ 
ing models for the history of pig domestication 
have emerged: 


1. Fig domestication took place in the early 
Holocene, during the early Neolithic (see 
Table 2.1) in a few geographically isolated 
locations (specifically the Near East and prob¬ 
ably China) and the descendants of those early 
domesticated individuals were later dispersed 
with the spread of early farmers and stock- 
herders (e.g. Rowley-Conwy, 2003). 

2 . Pig domestication took place within several 
additional (and independent) geographical 
regions outside the Near East and China and 
included Neolithic Europe (e.g. Zvelebil, 1995; 
Albarella et a/., 2006a) and even the Jomon 
period in Japan (Nishimoto, 2003). 

With the advent and application of new 
biomolecular and morphological techniques to 
both modern pig material and historical and 
archaeological remains, our understanding of 
both the geographical and temporal context of 
pig domestication and subsequent trajectories of 
livestock dispersal can now be refined, and in 
some cases radically altered. The following is a 
summary of the genetic and archaeological evi¬ 
dence for pig domestication as it currently 
stands. Figure 2.1 depicts nine possible pig 
domestication centres, which are discussed 
below. 


A general perspective 

Recent genetic research of modern extant Sus 
species has provided evidence for the second of 
the two hypotheses listed above. Mitochondrial 
sequences extracted from hair and blood sam¬ 
ples of recent European and Asian wild boar 
and various breeds of domestic pig revealed a 
distinctive Asian clade and two European clades 
(Giuffra et aI., 2000; Kijas and Andersson, 
2001). The Asian clade consisted of Japanese 
wild boar, Chinese Meishan domestic pigs and 
some European domestic pigs. Of the two 
European clades, the first consisted of the major¬ 
ity of European and all Israeli wild boar, as well 
as most European domestic pigs (including a 
sample from the Cook Islands in the Pacific), 
and the second was made up of Italian wild boar. 
A molecular clock approach in these studies and 
a subsequent study (Fang and Andersson, 2006) 
demonstrated that the two clades separated 
long before the advent of domestication, and 
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Table 2.1. A list of archaeological terms used in the text, their definitions and the general time frames 
that are associated with each. 


Holocene 

Mesolithic (or Epipalaeolithic) 

Neolithic 

Pre-pottery Neolithic A and B 
(PPNA and PPNB) 

Pottery Neolithic 

Linear Bandkeramic (LBK) 


Jomon 


Chalcolithic 


A geological epoch that began approximately 12,000 bp and 
continues today. A period intimately linked with the origins 
of agriculture and the rise of modern human culture 
A period beginning around 11,660 bp in the Near East and 
ending with the introduction of farming, the date of which 
varied in each geographical region 
Followed the Mesolithic and is not a specific chronological 
period, but a suite of behavioural and cultural characteris¬ 
tics specifically linked with the origins and spread of farming 
The early Neolithic of the Near East, characterized by the 
lack of pottery (dating from 9600 to c.8000bp) 

The period of the Neolithic of the Near East, characterized 
by the first use of pottery (beginning around 8000 bp) 

A major cultural entity of the European Neolithic, dating from 
-7400 to 6900 bp. The geographical focus of the LBK was 
the middle Danube, the upper and middle Elbe and the 
upper and middle Rhine - considered the earliest 
Neolithic culture in Europe 

The longest and earliest recorded human cultural complex in 
Japan, lasting from -12,000 to 600 bp and characterized 
by semi-sedentary hunter-gatherer communities 
Also known as the Copper Age or Eneolithic - this is a 
transitional period between the Neolithic and Bronze Age, 
linked with the beginnings of metallurgy. In the Middle East 
it begins around 6500 bp and lasts for approximately 1000 
years 


thus the genetics confirmed that at least two 
populations of wild boar (in western and eastern 
Eurasia ) were domesticated independently, and 
that female Asian domestic pigs were later 
hybridized with European domestics (Giuffra 
et a/., 2000). 

A subsequent genetic study (Larson et a/., 
2005) of several hundred Sus mitochondrial 
DNA (mtDNA) sequences sampled across 
Europe and Asia revealed a surprisingly strong 
correlation between each sample’s geographical 
origin and the position of that sample on the 
resulting phylogenetic tree. This clear phylogeo- 
graphical pattern (unique for any domestic ani¬ 
mal) revealed a clear East-West cline, indicating 
an evolutionary origin for the genus Sus in 
Island South-east Asia (ISEA) and a subsequent 
westward dispersal of wild boar, culminating in 
the western corners of the European continent. 
This natural migration established a pattern of 
wild boar genetic variation across the Old World 
that today remains largely intact, and which 
proved significant when attempting to explore 
where (and in how many places) pig domestication 


occurred. The idea was that, if domestic signa¬ 
tures clustered with wild haplogroups were 
found in a specific region, then wild boar from 
that region must have either been domesticated, 
or at least contributed maternal DNA to domes¬ 
tic stocks potentially domesticated elsewhere. 
The results clearly demonstrated that not only 
did domestic pigs from Europe and Asia cluster 
with European and Chinese wild boar as 
expected, but also that some domestic pigs clus¬ 
tered with wild boar indigenous to Italy, India 
and peninsular South-east Asia. A separate 
population of feral pigs on New Guinea, 
Halmahera (Moluccan Islands) and Hawaii pos¬ 
sessed yet another unique signature. 

Though the phylogeographical structure 
evident in the phylogenetic tree was supported 
by the vast majority of samples, some of the 
pigs possessed haplotypes that did not match 
those of the region from which they were sam¬ 
pled. This was not entirely surprising given both 
the fact that pigs are behaviourally plastic and 
capable of migrating long distances on their 
own and the fact that humans have also 
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Fig. 2.1. A map depicting nine possible pig domestication centres. (1) Eastern Anatolia. Archaeological 
evidence from several sites in this region dated to 10,000 bp demonstrates a clear intensification of the 
relationship between humans and pigs leading to complete domestication. The first domestic pigs in 
Europe were from stocks originally domesticated in this region. (2) Modern China. Though the number of 
places where pigs underwent a similar transition to that in the Near East is disputed, pig domestication 
certainly began independently along the Yellow River in about 10,000-9000 bp. (3) Europe. Virtually all 
modern European pigs are derived from European wild boar, and, although this process appears to have 
begun at about 7000 bp, it is unknown whether it was independent or was kick-started by the introduction 
of Near Eastern domestic pigs into Europe. (4) South Asia. Though the archaeological evidence for 
domestication is weak, several modern domestic pigs from India and Bhutan share identical 
mitochondrial haplotypes with a native South Asian wild boar, suggesting either an independent 
domestication, or a significant mitochondrial introgression of Indian wild boar into introduced domestic 
stocks. (5) Peninsular South-east Asia. The South Asian circumstance of limited archaeological 
evidence but the presence of native wild boar DNA in domestic stocks is repeated here by the recent 
finding of a separate clade of pigs that possess local wild boar mitochondria. (6) Southern Yunnan 
Province, Northern Vietnam, Northern Laos. A divergent wild boar population in this region possesses 
a unique haplotype only found in the ‘Pacific Clade’ pigs spread across Island South-east Asia and the 
Pacific by early farmers. Curiously, no modern domestic pigs from this region possess that signature, 
most likely as a result of a more recent replacement by domestic pigs indigenous to China. (7) Italy. The 
Italian peninsula plays host to a genetically differentiated indigenous wild boar. Mitochondrial sequences 
identical to those of these wild boar have also been found in domestic and feral pigs in Sardinia and 
Corsica, suggesting introgression (intentional or otherwise) between imported domestic pigs and the 
Italian wild boar. (8) Domestic pigs originally from the island of Lanyu off the coast of Taiwan possess 
highly divergent mitochondrial and nuclear signatures, though thus far no wild boar from the island or any 
surrounding islands with matching signatures have been found. This suggests that these pigs were 
domesticated from a now extinct wild boar that was indigenous to the small islands in the East China 
Sea. (9) Sulawesi. The Sulawesi warty pig (Sus celebensis) is found only on the island of Sulawesi and a 
few outlying islands, where it was probably transported by humans. Some have suggested that these 
animals hybridized with incoming Pacific Clade pigs, and it remains possible, though there is no 
archaeological evidence, so far, that the warty pig was independently domesticated. 
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transported domestic pigs and occasionally even 
wild boar (Vigne et a/., 2009). However, this 
observation meant that those pigs whose haplo- 
types did not match their region could be used 
as a proxy for human movement, and, in this 
context, two results from the tree were interest¬ 
ing (Larson et al., 2005). 

First, as mentioned above, the distinctive 
pigs found on New Guinea, Halmahera and 
Hawaii, where no indigenous populations of 
wild boar were ever present before being trans¬ 
ported by people, were distinct from the stand¬ 
ard East Asian pig signatures found across 
China. This strongly suggested that these pigs 
were derived from a separate population of 
wild boar than those that had given rise to the 
ubiquitous East Asian breeds, and that they had 
then been taken to the islands as part of a dis¬ 
persal by pig-herding farmers. Secondly, none 
of the domestic pigs in Europe possessed 
mtDNA signatures that matched those of wild 
boar from the Near East. The archaeological 
evidence for pig domestication in the Near East 
is well established (see below) though, if the 
first of the two hypotheses above were entirely 
true, then all European domestic pigs should 
possess Near Eastern signatures. The absence 
of any Near Eastern wild boar haplotypes in 
modern domesticated porkers from Europe 
suggested that, although pigs were definitely 
domesticated in the Near East, they must also 
have been domesticated in Europe, however 
independently, thus supporting the second of 
the two hypotheses (Larson et a!., 2005). 

As interesting as these conclusions were, 
because they were based entirely on samples 
that were no more than 200 years old, they 
meant that no empirical time frame for the 
domestication and migration of pigs could be 
ascertained. More simply, these data described 
where but not when domestication took place. 
Additional archaeological evidence (including 
the use of ancient DNA) was necessary to add 
a temporal dimension to these questions in the 
Near East, Europe, China, ISEA and Oceania. 

The Near East 

The zooarchaeological record of the Near East 
all but unequivocally demonstrates that the 
earliest pig domestication took place here. This 


evidence derives essentially from demographic 
changes that can be observed in Sus remains 
excavated from several key archaeological 
sites, principally in Eastern Anatolia (Turkey): 
Qayonii Tepesi, Hallan Qe mi Tepesi, Hayaz 
Tepe, Tell Hallula and Gtircutepe. 

A unique stratigraphic sequence at the site 
of (Tayonu Tepesi (which covers the entire Pre¬ 
pottery and Pottery Neolithic periods) reveals 
that pigs were killed at progressively younger 
ages through the entire sequence, which spans 
10,200-7500 bp uncalibrated (Hongo and 
Meadow, 1998; Ervynck et a/., 2002). In addi¬ 
tion, visible changes to the skeleton and teeth 
(e.g. snout shortening and changes in confor¬ 
mation) and evidence for an increase in physio¬ 
logical stress (revealed by hypoplastic defects in 
the dental enamel) (Ervynck et a/., 2002; 
Dobney et a/., 2004) reveal the actual process 
of a closer human-animal relationship. The 
process appears to have been quite slow, occur¬ 
ring gradually over -2000 years and apparently 
complete by the start of the Pottery Neolithic 
(-8000-7500 bp uncalibrated). Perhaps what is 
most interesting about the evidence from 
Qayonii is the fact that these changes appear 
not to have all been coeval; for example, there 
appears to have been a gradual shortening of 
the molar teeth through time, in contrast to a 
sudden shift to narrower molar teeth later in the 
sequence during the Pottery Neolithic period. 

These intimately linked zooarchaeological 
data have been used to infer a more complex 
physiological and behavioural response in pigs, 
perhaps not originally driven by direct human 
intervention. In fact, Ervynck et al. (2002) sug¬ 
gested that possible early morphological and bio¬ 
metrical changes to the skulls of pigs at (Tayonu 
were possibly related to changes in the rooting 
behaviour of wild individuals, perhaps attracted 
to human settlements by new food sources such 
as crops or human refuse. A long process of 
shifting pig-human relationships ensued, ulti¬ 
mately leading to direct control and full domesti¬ 
cation of some wild boar by the Pottery Neolithic 
period (Albarella et a/., 2006a). 

Though excavations are still being carried 
out, Rosenberg et al. (1995, 1998) have inves¬ 
tigated a huge collection of Sus remains recov¬ 
ered from the early Neolithic Turkish site of 
Hallan (Terni Tepesi (like £ayonu, situated in 
the Eastern Taurus mountains), and suggested 
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that a shift from a completely wild behaviour 
within a wild boar population to a way of living 
closer to humans may have taken place even 
earlier than at (Tayonu. In a review of Sus data 
from sites more recent than Qayonii or con¬ 
temporary with its later Pre-pottery phases, 
Peters et al. (1999) observed a decrease in the 
length of the third molar between Pre-pottery 
Neolithic B (PPNB) material from Qayonii and 
Late PPNB specimens from Gurcutepe (see 
Table 2.1). The authors claim this to be ‘un¬ 
equivocal morphometrical evidence for the 
occurrence of domestic pigs’ in the Late PPNB 
(Albarella et a/., 2006a). Peters et al. (1999) 
also highlight data from other Late PPNB sites, 
such as Hayaz Tepe and Tell Hallula, to sub¬ 
stantiate their claim for the appearance of 
domestic pigs in that chronological period. 

A genetics perspective on this region is 
difficult for two primary reasons, the first of 
which is that the dominant Muslim culture of 
the region has meant that very few, if any, 
modern pigs are kept, eaten or available to 
sample. Secondly, even if a modern popula¬ 
tion of Near Eastern domestic pigs existed, 
regions of the pig genome that correlate with 
the morphological changes described above 
have not yet been identified. The genetic win¬ 
dow that has been opened has thus far focused 
primarily on the mitochondrial genome, though 
this is changing as high-throughput sequencing 
becomes a reality. The neutrally evolving 
mitochondrion is ideal for understanding 
population-level differences as it reveals a level 
of resolution invisible at the phenotypic level of 
an individual pig, and, though the insights into 
where and how many times pigs were 
domesticated that this genome has provided 
have been invaluable, understanding how the 
process resulted in domestic pigs will come 
once genes involved in the process have been 
identified and sequenced (see below). 


Europe 

The establishment of economic and cultural ele¬ 
ments associated with Neolithic farming, includ¬ 
ing domestic pigs, took place across Europe 
between the 9th and 6th millenniums bp. How 
this process happened has been the focus of a 


great deal of research, the results of which have 
tended to focus on three primary hypotheses. 
According to these hypotheses, the European 
Neolithic may have resulted from either the 
direct migration of immigrant farmers from the 
Near East (known as demic diffusion; Childe, 
1957; Finhasi et al., 2005; Sampietro et al., 
2007), the transmission of the idea of domesti¬ 
cation and agriculture through established trade 
and exchange networks (cultural diffusion), or 
the independent development of agriculture 
(including the domestication of pigs and cattle) 
by indigenous European Mesolithic cultures 
(Renfrew, 1972; Clark and Price, 1981). It is 
worth pointing out that these explanations are 
not mutually exclusive (Zvelebil, 2000), as it 
may have been possible for both Near Eastern 
populations and ideas to have entered Europe, 
and then be taken up and expanded upon by 
local populations, though the degree to which 
either Near Eastern or ancient European popu¬ 
lations have contributed to the make-up of 
modern European human genomes also 
remains contentious (Haak et al., 2005; Belle 
et al., 2006). 

As mentioned earlier, Larson et al. (2005) 
not only revealed a complete absence of Near 
Eastern wild boar mtDNA signatures in mod¬ 
ern domestic pigs sampled from Europe, but 
also showed that European domestic pig 
mtDNA haplotypes were identical to those 
sampled from modern/recent European wild 
boar populations. European wild boar popula¬ 
tions must have begun contributing DNA to 
domestic pigs in Europe (even if those pigs 
were brought in from the Near East), though 
when this began remained an open question. 
In addition, the absence of Near Eastern signa¬ 
tures in modern domestic pigs strongly sug¬ 
gested that Near Eastern genetic lineages were 
either never introduced by farmers into Europe 
or, if they were, those mtDNA lineages have 
been all but completely replaced by those from 
pigs derived from European wild boar. 

Larson et al. (2007b) attempted to answer 
these questions by sequencing mtDNA 
sequences extracted from the archaeological 
remains (bone and teeth) of S. scrofa from 
Europe and the Near East. The genetic 
sequences obtained in this study from ancient 
(and additional modern) wild boar were placed 
into a temporal framework that allowed for the 
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timing of movements of different mtDNA lin¬ 
eages of pigs to be determined. The relative 
geographical position and haplogroup affinity 
of 20 pre-Neolithic (13,000-7500 bp) wild 
boar samples from Europe were identified. The 
observation that all of these samples possessed 
European haplotypes (Larson et a/., 2007b) 
supported previous observations of a phylogeo- 
graphical boundary between Near Eastern 
and European wild boar haplotypes (Larson 
et a/., 2005), and suggested that this biogeo- 
graphical boundary was in place during the 
early Holocene. Additional evidence came 
from four wild individuals dated from Neolithic 
and post-Neolithic (Chalcolithic) sites in 
Romania close to the modern phylogeographi- 
cal boundary that also possessed European 
haplotypes. Although the exact location of this 
boundary is difficult to establish, the appear¬ 
ance of a Near Eastern pig haplotype in two 
supposed ‘wild’ S. scrofa specimens from 
Mesolithic and Neolithic sites in the Crimea 
indicated that the distribution of Near Eastern 
haplotypes probably extended to the north 
shore of the Black Sea. 

Unique Italian genetic sequences, originally 
identified by Giuffra et al. (2000), Kijas and 
Andersson (2001) and Lattuada et al. (2009), 
were found both in samples of Mesolithic Italian 
wild boar and in Mesolithic Croatian and 
Medieval Sardinian samples, thus extending the 
historical geographical range of this haplogroup 
and suggesting that Italian wild boar had been 
transported to Sardinia, though it was impossi¬ 
ble to say whether the original pigs were wild or 
domestic (Larson et al., 2007a). 

Later (Neolithic) S. scrofa mtDNA 
sequences from Europe clearly demonstrate 
that pigs with Near Eastern haplotypes did 
cross the phylogeographical boundary, the 
clear result of a human-mediated dispersal. 
Eleven pig specimens (identified by standard 
zooarchaeological criteria as domestic), from 
four Neolithic Romanian sites dating to 
7500 bp, possessed a Near Eastern haplotype 
identical to that found in a single recent mod¬ 
ern boar from Turkey and two from Iran. Five 
so-called wild specimens (based on their large 
size) from the same site possessed European 
haplotypes, suggesting that, though local farm¬ 
ers were herding pigs whose ancestors were in 
the Near East, they were hunting and eating 


local wild boar. The same Near Eastern 
haplotype was also identified in four specimens 
from the 8th millennium bp Linear Bandkeramik 
(LBK) site of Eilsleben in northern Germany, 
and in two samples from the mid Neolithic 
(very early 6th millennium bp Chasseen culture) 
site of Bercy in the Paris Basin. At Bercy, 
mtDNA sequences of European origin were 
also extracted from archaeological specimens 
identified by zooarchaeological criteria as 
domestic swine, making it the earliest site 
where both Near Eastern and European 
domestic swine have been identified together. 

The results described above clearly dem¬ 
onstrate that Near Eastern-derived domestic 
pigs were physically moved westward with 
early farmers into Central and Western Europe 
during the Neolithic. However, after the mid- 
6th millennium bp, Near Eastern Sus lineages 
vanished, perhaps in as little as 500 years. 
Thus, by at least the 6th millennium bp, 
European wild boar had already been domesti¬ 
cated and the domestic form spread through¬ 
out Europe, replacing the original introduced 
pigs of Near Eastern origin. Whether this 
replacement was the result of continual hybrid¬ 
ization by the only locally available source of 
wild boar, thus minimizing and then eliminat¬ 
ing the genomic input of Near Eastern wild 
boar, or a more active selection against those 
pigs who displayed Near Eastern affinities is 
not yet known, though again, with the appli¬ 
cation of increasingly robust sequencing pow¬ 
ers, this question will hopefully be answered in 
the next decade. 

The zooarchaeological record provides 
additional important evidence for this complex 
process. Although pig remains are compara¬ 
tively scarce relative to those of other domestic 
animals from Neolithic Romanian sites 
(Bala§escu et al ., 2005) and from most LBK 
sites in central and western Europe (Tresset 
and Vigne, 2004), they increase in abundance 
(Tresset, 2003) and size (Tresset and Vigne, 
2007) starting in the first part of the 7th mil¬ 
lennium bp, well after the initial introduction of 
Near Eastern domestic pigs. There is also a 
clear increase in the frequency of physiological 
stress (as evinced by incremental defects in the 
dental enamel, known as enamel hypoplasia) 
between S. scrofa remains of Mesolithic and 
Neolithic date (Dobney et al., 2004). 
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Interestingly, the replacement of Near 
Eastern domestic pigs by European popula¬ 
tions was not restricted to Europe. A series of 
archaeological mtDNA sequences extracted 
from the 7th to the 4th millennia bp through to 
the early Iron Age Sus remains from Armenian 
archaeological sites revealed a clear transition 
through time from Near Eastern to European 
haplotypes. This turnover, analogous to that 
seen in European sites, suggested that, by the 
9th century bp, European domestic pigs had 
spread eastward to Armenia replacing the 
indigenous domestic pigs of Near Eastern 
ancestry (Larson eta/., 2007a). This transition 
probably reflects the major reorganization that 
occurred in the various Neo-Hittite polities dur¬ 
ing the 3rd millennium bp, involving the large- 
scale movement of peoples and the expansion 
of trade and exchange networks during the 
later Iron Age (Smith, 2003). 

The presence of the unique Italian haplo¬ 
types (mentioned earlier) in modern ‘wild’ pigs 
of Sardinia, which, like Corsica, was an island 
devoid of pigs before the arrival of Neolithic 
settlers (Vigne, 1992), supports the possibility 
that indigenous Italian wild boar may also have 
been separately domesticated (Albarella et a/., 
2006b). The Italian haplotype was also identi¬ 
fied in numerous ancient samples, including 
Mesolithic wild boar from Pupicina cave in 
Croatia, early and mid Neolithic pigs from 
Grotta della Madonna cave in south-western 
Italy, a 4th millennium bp (middle Bronze Age) 
Sardinian site, and numerous medieval wild 
boar from Tuscany and Rome. These samples 
indicate that not only were indigenous Italian 
wild boar distributed beyond their current 
restricted region of Maremma in north-west 
Italy, but the presence of the Italian haplotype 
in Bronze Age central Sardinia also suggests 
either an independent domestication of native 
Italian wild boar or the incorporation of female 
Italian wild boar into domestic stocks that were 
subsequently imported to Sardinia by at least 
the end of the 4th millennium bp. 

The zooarchaeological evidence supports 
these conclusions. In a summary of prehistoric 
pig exploitation in the Italian peninsula and 
Sicily, Albarella et al. (2006b) found a broadly 
consistent diachronic pattern of change in Sus 
body shapes across Italy. During the Mesolithic, 
Italian wild boar were relatively small (though 


their bones were quite large in relation to the 
teeth). This pattern persisted in the Neolithic, 
although shifts in pig exploitation at some sites 
were interpreted as the beginning of local 
domestication. A clear morphological separa¬ 
tion between wild and domestic pigs is not evi¬ 
dent until the Bronze Age, when domestic pigs 
decreased in size and wild boar appear to have 
increased (Albarella et a/., 2006b). 

The genetic and zooarchaeological records 
therefore reveal a complex temporal and geo¬ 
graphical pattern of changes in Holocene 
Europe. These data provide unique insights and 
answers to specific questions about the nature 
of the European Neolithic revolution. First, the 
presence of Near Eastern haplotypes in 
Neolithic contexts in Romania, Germany, 
France and Croatia demonstrates that Near 
Eastern pigs were introduced by early farmers 
into Europe. Secondly, given the time frame of 
the initial introduction of Near Eastern domes¬ 
tics and the first appearance of domestic pigs 
derived from European wild boar, it appears 
that European pig domestication may not have 
been a truly independent event but rather a 
direct consequence of the initial introduction of 
Near Eastern domestic pigs (and other animals) 
into Europe by early farmers. If true, the proc¬ 
ess of pig domestication in Europe (and possi¬ 
bly the degree of intentionality among early 
farmers) appears fundamentally different from 
that in the Near East. Regardless of the specific 
cause or progression of European pig domesti¬ 
cation, what is clear is that, once European wild 
boar were domesticated, they rapidly became 
the predominant lineage within European, and 
later South-west Asian, domestic swine. 

China 

China has been accepted as a primary and 
independent centre of early pig domestication, 
and recent studies in East Asia have highlighted 
the antiquity of plant and animal domestication 
and the generally greater amount of genetic 
variation found in East Asian wild boar and 
pigs relative to their European counterparts 
(Fang et a/., 2005, 2009; Megens et a/., 
2008). Early agricultural activities practised by 
seasonally mobile cultivators were well estab¬ 
lished along the Yellow River and Inner 
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Mongolia by -8000 bp (Barton et a/., 2009; 
Fuller et al., 2009; Liu et a/., 2009), and plant 
cultivation may have even begun 2000 years 
earlier on the hilly flanks of the Yellow River 
Valley (Lu et al., 2009). In southern China, it 
was sedentary hunter-gatherers (Fuller and 
Qin, 2009) who first began cultivating rice 
along the Yangtze river at about 9000-8000 bp, 
a process that culminated in the dependence 
upon domesticated rice agriculture by -6000 bp 
(Fuller et al., 2009). Although dogs may have 
been the earliest domesticated animals in these 
regions, archaeological evidence suggests that 
domestic pigs were soon prevalent in both 
northern and southern China by at least 
8000 bp (Yuan and Flad, 2002; Flad et al., 
2007). In both regions, however, pigs make up 
only a small percentage of the earliest mammal 
bone assemblages, which are primarily domi¬ 
nated by remains of hunted deer (Flad et al., 
2007; Jing et al., 2008). 

Long-standing claims for very early 
(-12,500 years bp) domestic pigs at, for exam¬ 
ple, the site of Zengpiyan near Guilin, have 
more recently been refuted by reanalyses of 
the zooarchaeological record by a new genera¬ 
tion of Chinese zooarchaeologists (e.g. Yuan 
and Flad, 2002). Recently, systematic and tra¬ 
ditional analyses of previously excavated 
Chinese zooarchaeological assemblages and 
subsequent data syntheses - along with the dis¬ 
covery of new sites - have led to the proposal 
of a new model for pig domestication in China: 
one that involves multiple and independent 
centres focused upon the Yellow River and on 
the Lower Yangtze river around 8000 years 
ago (Jiang, 2004). The sheer geographical 
scale of China, which has revealed a substan¬ 
tial diversity of Neolithic cultural entities scat¬ 
tered within a mosaic environment where at 
least two subspecies of wild boar exist (Groves, 
1981, 2007; Groves and Grubb, 1993), makes 
such a model highly plausible. However, under¬ 
standing the natural and cultural dynamics 
behind the early pig domestication process in 
China requires further integrated studies in 
which novel morphometric and genetic 
approaches have proved invaluable. 

As mentioned previously, a very strong 
phylogeographical structure among wild boar 
was revealed using a phylogenetic tree, with a 
clear cline from East to West across the Old 


World (Larson et al., 2005). However, while 
the western Eurasian and ISEA branches of 
that tree consisted of well-supported clades, the 
portion of the tree that comprised samples 
from the East Asian mainland was defined by a 
large polytomy of clades and individual 
branches. As a result (and because of the pau¬ 
city of Chinese samples in the study) it was 
impossible to say much more beyond the fact 
that pig domestication had taken place, at some 
point in the past, somewhere in East Asia. A 
series of larger genetic studies was therefore 
undertaken in an effort to refine the geographi¬ 
cal and temporal context of East Asian pig 
domestication by adding more modern and 
ancient East Asian wild boar and domestic pigs 
sequences (Larson et al., 2007b, 2010). In 
addition to the genetic research, several com¬ 
plementary morphological proof-of-concept 
studies - involving tooth shape analyses of both 
recent and archaeological Sus specimens - 
were first piloted in an attempt to explore the 
potential complementarity of such an approach 

(Larson et al., 2007b; Cucchi et al., 2009, 
2011). Though a true understanding of pig 
domestication is only possible using a combina¬ 
tion of both approaches, the results of genetic 
and morphological approaches will be explored 
separately below. 

A recent study (Larson et al., 2010) com¬ 
bined over 1000 modern wild and domestic pig 
samples with the results from an ancient DNA 
analysis that generated sequences from nearly 
20 specimens spanning the last 9000 years. 
The data revealed 34 haplotypes found only in 
wild specimens, 99 haplotypes found only in 
domestic specimens, with 20 haplotypes shared 
between 86 wild and 581 domestic pigs. In 
addition, 25 separate clades that consisted of at 
least two haplotypes were present on the phy¬ 
logenetic tree, although most of the haplotypes 
were arranged as a large polytomy referred to 
as the general cluster. 

Contrasting the phylogenetic position of 
these S. scrofa sequences with their geographi¬ 
cal provenance revealed several interesting pat¬ 
terns. The domestic specimens found within the 
general cluster were present throughout East 
Asia, and many of them represent the most com¬ 
mon Asian pig haplotypes found today in globally 
distributed modern breeds (Larson et al., 2007b). 
In contrast, the general cluster haplotypes found 
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only in wild boar, and those that were shared 
between wild boar and domestic pigs, were more 
restricted geographically; they were found mostly 
in Central China, or in the neighbouring Yunnan 
Province and Bhutan. 

The geographical patterning of the speci¬ 
mens found in the 14 clades outside the gen¬ 
eral cluster made up only of wild boar and in 
the three clades made up of wild and domestic 
pigs showed a different pattern. Of the wild 
clades, four were restricted to islands including 
Japan, Okinawa, Taiwan and the southern 
Chinese island of Hainan. Three clades were 
found only in South Korea. One clade was 
restricted to North-east Asia (not including 
South Korea), one to Central China (though 
two samples are also found in northern 
Vietnam), and five clades were found only in 
samples from Indochina. 

Of three clades that possessed haplotypes 
found in both wild and domestic samples, the 
first included both native wild boar and domes¬ 
tic pigs from India and Nepal (Larson et ai, 
2005; Tanaka et a/., 2008). The second clade, 
referred to as the Pacific Clade by Larson et ai. 
(2005), included six samples found in northern 
Vietnam, Yunnan Province and Laos. Despite 
additional sampling of domestic pigs from 
Central China and Indochina, the only domes¬ 
tic or feral pigs that belonged to this clade were 
found in Oceania. The third clade, referred to 
as MTSEA, given their mountainous and 
South-east Asian distribution (Tanaka et ai, 
2008), consisted of both wild and domestic 
samples found almost exclusively in Indochina, 
although one domestic sample from Bhutan 
and one sample described as wild from Taiwan 
were also found within this clade. 

Network analysis revealed a separation of 
the clades that consisted solely of wild 
specimens from those of domestic or mixed 
origin, and the addition of 18 archaeological 
specimens to the network did not alter this 
pattern. Also, though three of the nine ancient 
haplotypes were novel, the six remaining 
haplotypes were identical to the most com¬ 
mon haplotypes found within all modern East 
Asian pigs. In fact, the ancient samples derived 
from archaeological sites spanning 5000 years 
possess five of the seven most common haplo¬ 
types shared by modern wild and domestic 
East Asian pigs. 


These genetic results (Larson et a/., 
2007b, 2010) clearly demonstrate that the 
most common modern domestic haplotypes 
found within central China are also the most 
common Asian haplotypes found across East 
Asia, in Australian feral pigs, and in modern 
European and American breeds, the latter as a 
consequence of the 18th-century drive to 
improve European breeds by hybridizing them 
with imported Asian pigs (Jones, 1998; Giuffra 
et a/., 2000; Yang et a/., 2009). The lack of 
fine geographical resolution of the mtDNA 
data across central China (probably resulting 
not just from a lack of genetic differentiation 
between wild boar populations in this area, but 
also from a history of human-mediated move¬ 
ment that has blurred whatever signal may 
have originally existed) precludes any conclu¬ 
sions regarding multiple centres of domestica¬ 
tion here, although the identification and typing 
of distinguishing markers in the nuclear genome 
in ancient samples may generate enough reso¬ 
lution to address this question. 

The position of ancient DNA sequences 
among the most common haplotypes on the 
network clearly supports recent zooarchaeo- 
logical conclusions that purport modern 
Chinese pigs to be the direct descendants of 
the original populations of domestic pigs sam¬ 
pled along the Yellow River (Jing et ai, 2008). 
Though this evidence does not rule out sepa¬ 
rate pig domestications from geographically 
and genetically differentiated wild boar in other 
parts of Central China, it does demonstrate a 
long-term genetic continuity between early and 
modern domestic pigs. In addition, neither 
modern nor ancient pig samples in this study 
share any genetic affinity with modern wild 
boar from Central China. The combination of 
a shared geographical distribution but mainte¬ 
nance of a strict genetic differentiation suggests 
not only that domestic haplotypes have not 
leaked into wild populations, but also that nei¬ 
ther ancient nor modern herders of domesti¬ 
cated pigs made a lasting effort to incorporate 
the females of other separate wild populations 
into their domesticated stocks. 

The evidence from the countries surround¬ 
ing China, however, seems thus far to suggest 
that, regardless of how many times wild boar 
were domesticated in what is now China, there 
were no analogous processes of domestication 
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on the Korean peninsula, in Japan, on the 
Ryukyu archipelago or on the island of Taiwan. 
Although each of these regions harbours 
genetically differentiated populations of wild 
boar that probably migrated to these places 
during periods of lowered sea level, the domes¬ 
tic pigs that currently occupy these places 
derive from the Chinese domestication process 
described above (Larson et al., 2010). 

There is, however, one notable exception 
to this pattern. Microsatellite and mtDNA from 
pigs originally from the tiny island of Lanyu, off 
the south-east coast of Taiwan, have revealed 
that this population is significantly different 
from all other wild and domestic pigs, though 
they are more similar to east Asian populations 
than to those from ISEA (Wu et a!., 2007; 
Chang et al., 2009). More intriguingly, the 
island of Lanyu does not play host to a popula¬ 
tion of equally unique wild boar, suggesting 
either that the wild ancestor of these pigs was 
endemic to Lanyu and has since been extermi¬ 
nated, or that Lanyu domestic pigs were derived 
from an as yet undiscovered population of wild 
boar. Either way, this population represents the 
sole indigenous domestic variety found among 
islands and land masses east of China, and is 
deserving of a great deal more study. 

Although ancient and modern DNA sam¬ 
ples are proving to be one of the most useful 
tools for investigating domestication and 
human-mediated dispersal of domesticated ani¬ 
mals, the mtDNA and nuclear markers used in 
palaeogenetics are often selectively neutral 
and, therefore, incapable of detecting responses 
related to rapid phenotypic changes in the 
skeleton associated with the domestication 
process (Bradley, 2006). In addition, the pres¬ 
ervation of DNA is also a crucial limitation in 
palaeogenetics. The Near Eastern cradle of 
domestication, for example, is characterized by 
high and fluctuating temperatures that have 
reduced the availability of ancient sequences 
(Bollongino and Vigne, 2008). Molar shape 
analysis is a complementary tool that can be 
used to assess S. scrofa variation using geo¬ 
metric morphometries to track phenotypic 
responses associated with both the process of 
domestication and its geographical origins. 

A recent study (Cucchi et al., 2011) inves¬ 
tigated Sus remains from three early Neolithic 
sites in mainland Northern, Central and 


Southern China (Xinglongwa in Hebei 
Province, Jiahu in Henan Province and 
Zengpiyan in Guangxi Province), two of which 
(Xinglongwa and Zengpiyan) have been previ¬ 
ously used to support claims for early pig 
domestication (Nelson, 1998; Yuan and Flad, 
2002). Pig remains from the late Neolithic site 
of Xishuipo (Henan Province) were also 
included in the analysis as comparative data as 
their domestic status is unequivocal. The evi¬ 
dence for pig domestication at Xinglongwa (in 
the far north-east of China) is based on an 
association between two complete S. scrofa 
skeletons and a wealthy human burial dated 
from the latest phase of occupation (8000- 
7400 bp) (Jing, 2006), and large collections of 
pig skulls found in subterranean buildings, all of 
which possess unambiguous, circular holes in 
the frontal bone associated with poleaxing. 

As mentioned above, excavations in 
Southern China during the 1970s at the lime¬ 
stone cave of Zengpiyan in Guangxi Province 
resulted in claims for the earliest evidence of 
pig domestication in China (Chang, 1986). 
The site has recently been re-excavated 
(Institute of Archaeology, 2003) and a reinter¬ 
pretation of the sequence has resulted in final¬ 
izing the dates of occupation as between 
12,000 and 7000 bp. Reinterpretation of the 
vertebrate assemblage from Zengpiyan has 
revealed a subsistence economy most likely 
focused exclusively upon hunting and gather¬ 
ing throughout the occupation sequence, 
although a change in ceramic and lithic tech¬ 
nologies observed in the latest phase indicates 
a potential shift towards cultivation (Institute of 
Archaeology, 2003). The Sus remains, how¬ 
ever, are from small individuals, and thus their 
wild, domestic or semi-domestic status remains 
uncertain. 

Two additional sites in East Central China 
(Jiahu and Xishuipo, Henan Province) are from 
a key region of the central Yellow River valley, 
both of which have produced important zoo- 
archaeological collections. Jiahu has been 
conventionally radiocarbon dated to 9000- 
7700cal (calibrated years) bp (Zhang et al., 
1999), confirmed by thermoluminescence (TL) 
dating (Yang et al., 2005). It is perhaps one of 
the most significant Early Neolithic sites in 
China, and is of particular importance in our 
understanding of the development of Neolithic 



Genetic Aspects of Pig Domestication 


27 


civilization in the Yellow River basin (Zhang 
and Wang, 1998; Zhang et a/., 2004), with 
evidence of early writing (Li et a/., 2003) and 
early development of music (Zhang et a/., 
1999), and where evidence of the earliest rice 
domestication in Northern China remains hotly 
debated (Fuller et a/., 2007). The late Neolithic 
site of Xishuipo is located on the Northern 
shore of the Yellow River and is dated to 

5000-3000bp (Kesner, 1991). 

Shape analyses of more than 100 Sus 
lower second molars from both modern com¬ 
parative and archaeological specimens were 
undertaken and, although the data set was rela¬ 
tively small, important conclusions could still be 
drawn. First, the data demonstrated that molar 
tooth shape can be used as an effective pheno¬ 
typic marker, and can provide highly significant 
discrimination between wild and domestic 
forms (Cucchi et a/., 2011). Secondly, the 
analysis revealed that the Zengpiyan pigs dis¬ 
played a typically wild second molar morpho- 
type, contradicting the earlier claims of the 
presence of pig domestication at Zengpiyan 
(Nelson, 1998) and supporting more recent 
claims that the pigs were in fact wild boar (Yuan 
and Flad, 2002). The Zengpiyan wild morpho- 
type was not identical in shape or in size to that 
represented by the modern wild boar compara¬ 
tive data used in this study, however, suggest¬ 
ing that different wild Sus morphotypes existed, 
and may still exist, across China, a result sup¬ 
ported by the genetic studies discussed above. 
Indeed the, North-eastern Chinese wild boar 
comparative specimens used in this study origi¬ 
nated from the current geographical range of 
S. s. ussuricus, recognized as the largest sub¬ 
species of S. scrofa , whereas Zengpiyan is situ¬ 
ated in (wetter and warmer) South China within 
the current range of the southern subspecies, 
S. s. moupinensis (Groves, 2007). This south¬ 
ern subspecies is today described as ‘fairly 
small but with a broad and high crowned skull’ 
(Groves, 2007), a description that fits with the 
dental traits identified by Cucchi et al. (2011) 
at Zengpiyan. 

Morphometric signatures from the Sus 
remains from Xinglongwa in North-east China 
also revealed the presence of a clearly wild 
morphotype that showed strong phenetic rela¬ 
tionships with the modern S. s. ussuricus 
specimens included in the study (Cucchi et a I ., 


2011). These data suggested that the 
Xinglongwa pigs derive from locally hunted 
wild boar and that the wild boar phenotype in 
Northern China has been conserved without 
major morphological change for at least the 
last 8000 years. This, and other evidence at 
Xinlongwa, supports the idea of an early sym¬ 
bolic (as well as economic) value of wild boar, 
and the practices associated with this more 
intimate relationship perhaps led ultimately to 
an independent domestication of pigs in North¬ 
east China. 

The story from the early Neolithic site of 
Jiahu appears to be different. Molar shape 
analyses by Cucchi et al. (2011) demonstrated 
that Jiahu pigs and domestic pigs from the 
later site of Xishuipo have clear domestic sec¬ 
ond molar morphotypes that, though similar, 
are not identical. Both data sets display a com¬ 
mon secondary divergence trajectory indicat¬ 
ing that both populations have undergone the 
same directional phenotypic evolution. On the 
basis of these data, it has been argued that as 
early as 10,600 bp, the pigs at Jiahu were 
clearly in the process of undergoing morpho¬ 
logical change triggered by domestication 
(Cucchi et al., 2011). 

The evidence from the recent genetic and 
morphological analyses confirms the suspected 
independent domestication of wild boar in 
China, though so far neither of the types of 
analysis has been able to definitively state how 
many different times in how many different 
places the process of ever-tightening depend¬ 
ence of wild boar and people took place. As 
more samples are analysed, and as the resolu¬ 
tion generated by each of these techniques 
becomes ever greater, it seems likely that the 
ongoing uncertainty will be significantly 
reduced, perhaps to the point where definitive 
answers are uncovered. Evidence for additional 
centres of East Asian pig domestication outside 
China is emerging as well, and is discussed in 
the next section. 

Peninsular South-east Asia, 

Island South-east Asia and Oceania 

This region is one of the most important and 
interesting for studying the range of different 
relationships between pigs and humans, not 
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only because the Sus genus probably originated 
in ISEA, but also because the genetic diversity 
here is greater than anywhere else (Groves, 
1981, 1983). Despite the near ubiquitous 
presence of Sus remains from Holocene 
archaeological sites in mainland East Asia, 
ISEA and Oceania, however, comparatively lit¬ 
tle is known about the temporal context of pig 
domestication, the species involved, or their 
involvement in the different human diasporas 
of the region (Hardjasasmita, 1987; Lucchini 
et al., 2005). 

Fig domestication in mainland Asia is rela¬ 
tively simple given that only one species 
(S. scrofa) could have been involved. In ISEA, 
however, a variety of Sus species (S. scrofa, 
S. verrucosus, S. barbatus, S. celebensis, 
S. cebifrons and S. philippensis - see Chapter 1) 
are endemic, all of which were hunted by 
humans in the past and presumably had the 
potential to be domesticated. The taxonomy of 
these island suids remains contentious, as does 
their species identification and wild, feral or 
domestic status within the fossil record (Cucchi 
et al., 2009). To make matters worse, inter¬ 
species hybridization between introduced 
S. scrofa and other indigenous species has 
been claimed (Blouch and Groves, 1990). 
Groves (1981) has also stated on the basis of 
morphometries that the feral and domestic pigs 
of New Guinea were hybrids of S. scrofa and 
S. celebensis. 

The previously discussed genetic study 
(Larson et al., 2005) of modern wild and 
domestic Sus supported Groves’ (1981, 1983) 
conclusions stating that the genus Sus origi¬ 
nated in ISEA and also revealed evidence for 
additional pig domestications in South-east 
Asia. First, one haplotype indigenous to wild 
boar from peninsular South-east Asia was also 
typed in Australian feral pigs from northern 
Queensland. Australia was devoid of any Sus 
species until pigs were introduced by post- 
European contact settlers. The current feral 
population is therefore made up of escaped 
domestic pigs that are mostly European, 
though the presence of the South-east Asian 
haplotype complicates the picture by suggest¬ 
ing either that indigenous South-east Asian 
pigs were independently domesticated and 
transported, or were possibly moved into 
Australia as wild animals. Neither scenario is 


supported by any historical evidence, and the 
absence of the South-east Asian haplotype in 
between Australia and Thailand only further 
confounds the issue. Secondly, the data 
revealed another phylogeographical anomaly: 
the presence of a significant number of Sus 
samples on islands east of the major biogeo- 
graphical boundary known as Wallace’s Line 
that possessed mtDNA signatures consistent 
with an East Asian mainland origin. These 
haplotypes, collectively referred to as the 
Pacific Clade (Larson et al., 2005), were 
present in every pig sampled from New Guinea, 
the eastern Indonesian island of Halmahera 
and in several specimens in remote Oceania as 
far east as Hawaii. Because these haplotypes 
clustered with others indigenous to East Asia, it 
followed that a population of wild boar, sepa¬ 
rate from those found in China, had been 
domesticated and moved through ISEA and 
into the Pacific islands (possibly associated with 
the Austronesian expansion), though, because 
no mainland samples possessed these haplo¬ 
types, determining precisely where this domes¬ 
tication had taken place was not possible. 

Current archaeological evidence suggests 
that, once established, intensive, sedentary 
agriculture (including rice, millet and pigs) 
expanded across northern Asia in about 
6500-5000 bp and across South-east Asia 
between 5000 and 4000 bp during the Late 
Neolithic (Chang, 1986; Liu, 2004; Bellwood, 
2006). Though the archaeological evidence 
for the domestication of pigs in central China 
is relatively well established (see above), there 
is currently no indication of domestic pigs in 
peninsular South-east Asia until the end of 
the 5th millennium bp, when they are said to 
appear alongside the first evidence of seden¬ 
tary agriculture (Higham, 1975, 2002b). 
Domesticated pigs and rice are also claimed 
to be present in South-east Asia (Thailand) no 
earlier than -4000 bp (Higham, 1975, 2002b). 

Despite the lack of archaeological corrob¬ 
oration, the significant genetic variation present 
in wild and domestic pigs in Indochina has pre¬ 
viously been used to propose an independent 
centre of pig domestication somewhere in 
peninsular South-east Asia (Jing et al., 2008). 
More recent genetic evidence (Larson et al., 
2007b) clearly demonstrated that some breeds 
of domestic pigs share haplotypes with two 
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clades of differentiated wild boar that are cur¬ 
rently found only in Indochina, thus suggesting 
the domestication of wild boar lineages in this 
region, though the details and temporal con¬ 
text of this process remain unknown. What is 
clear, however, is that, once domesticated, 
Pacific Clade pigs were dispersed by people 
into ISEA, Wallacea and the Pacific. 

The sole sequences that rooted the path¬ 
way of Pacific Clade haplotypes in mainland 
Asia were reported from two pigs from north¬ 
ern Vietnam (Hongo et a/., 2002). Four more 
Pacific Clade sequences have been identified in 
modern wild boar from Laos and Yunnan 
Province, China (Larson et a/., 2010). These 
samples expand the geographical range of wild 
boar possessing Pacific Clade signatures and 
add support to the hypothesis that the Pacific 
Clade is indigenous to peninsular South-east 
Asia. This evidence also supports the postulated 
Neolithic expansions of Austro-Asiatic language 
speakers along the major South-east Asian riv¬ 
ers from Yunnan (Higham, 2002a). The fact 
that no modern domestic pigs possessing Pacific 
Clade haplotypes have yet been found in main¬ 
land Asia is most likely a consequence of a 
replacement of native pigs by pigs introduced 
from China, a situation analogous to the previ¬ 
ously discussed replacement of Near Eastern 
domestic pigs by European domesticates during 
later prehistory (Larson eta/., 2007a). 

There have been several demographic 
expansions of agricultural populations into 
peninsular South-east Asia that could have 
resulted in the importation of domestic pigs 
from (for example) central China. These include 
Austronesian speakers through ISEA and parts 
of the mainland coastal regions (Pawley, 2003), 
post-Neolithic expansions of Sino-Tibetan 
speakers (Van Driem, 1998; Pawley, 2003) 
and Austro-Tai or Miao-Yao groups from 
Southern China (Blench, 2005). Importantly, 
the replacement did not extend beyond main¬ 
land South-east Asia, leaving intact the popula¬ 
tions of introduced domestic Pacific Clade pigs 
in ISEA. 

With the exception of the wild boar 
sequences mentioned above, all other Pacific 
Clade pigs appeared to be scattered throughout 
ISEA and the Pacific (Larson et a/., 2010), 
though four Pacific Clade pigs were identified in 
Sumatra and Java (islands on which indigenous 


wild populations of Sus existed well before the 
Neolithic), while Pacific Clade pigs also made up 
15 of 19 specimens from eight islands east of 
Wallace’s Line in the Lesser Sunda chain and 
the Moluccas, as well as 100% of New Guinean 
samples. This pattern indicates that, like 
Australian ‘wild pigs’, the New Guinea feral pig 
population is also descended from mainland 
Asian domestic pigs, though in this case the 
original population was originally introduced by 
farmers carrying Pacific Clade domestic pigs 
travelling east over Wallace’s Line. 

Perhaps more important in ascertaining 
the migration route that pigs and humans took 
to reach New Guinea is the fact that no Pacific 
Clade pigs have yet been identified from 
Taiwan (despite numerous samples from both 
native wild and domestic modern pigs and an 
ancient domestic sample), from the 40 wild 
samples from the Philippines (identified as 
endemic S. philippensis), or from the 17 intro¬ 
duced domestic samples from two central 
Philippine islands (Panay and Cebu). 

In order to provide at least some temporal 
context for pig dispersal in Oceania, ancient 
DNA was also successfully extracted from five 
archaeological pig specimens from purportedly 
pre-European contact sites in Polynesia (from 
Tubuai, Hanamiai in the Marquesas and the 
Tangatatau rock shelter in Mangaia), the Reef 
Islands and Mussau. Although somewhat lim¬ 
ited in number and geographical and temporal 
coverage, the sequences of these specimens 
demonstrated they were all Pacific Clade pigs, 
thus linking them with the Polynesian dispersal 
and, by association, with that of the earlier 
Lapita cultural complex - traditionally associ¬ 
ated with the first peopling of remote Oceania 
(Kirch, 2000). 

Analyses of a further six ancient mtDNA 
sequences extracted from archaeological Sus 
specimens from Liang Bua Cave (Flores, Lesser 
Sundas) demonstrated two separate human- 
mediated dispersals of domesticated S. scrofa 
(Pacific Clade pigs) and one involving an 
endemic Wallacean Sus species (S. celeben- 
sis). As S. celebensis is now endemic only to 
Sulawesi (assuming it was not naturally distrib¬ 
uted beyond Sulawesi in the early Holocene), 
its presence at Liang Bua on Flores as early as 
7000 bp (Van den Bergh et a/., 2009) suggests 
its early Holocene translocation by humans, 
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perhaps as a domestic animal as previously 
suggested by Groves (1983). Another possible 
explanation is that S. celebensis was trans¬ 
ported by hunter-gatherers to seed the island 
with game, just as the hunter-gatherers of the 
Levant introduced wild boar to Cyprus several 
thousand years before (Vigne et a/., 2009). 
Human-mediated translocation of other wild 
species in the region is evidenced by the intro¬ 
duction of the cuscus 20,000 ago into the 
Bismarck archipelago (White, 2004). 

Independent verification of the distinctive¬ 
ness of pigs with the Pacific mtDNA signature 
has been shown by morphometric analysis of 
the lower third molar (M 3 ) from the recent New 
Guinea and Flores pigs and archaeological pigs 
from the Flores site of Liang Bua using the 
same specimens analysed by Larson et a/. 
(2007a). Molar shape analysis of the archaeo¬ 
logical pigs from Holocene deposits at Liang 
Bua confirms the mtDNA evidence for the 
presence of two Sus taxa at the site: an endemic 
Wallacean type and several sharing the unique 
tooth morphotype of recent New Guinea pigs 
(Groves, 1983, 1995). In the light of the chrono- 
stratigraphic position of these remains (Van den 
Bergh et a/., 2009), these data show that two 
separate human introductions of Sus to the 
island of Bores occurred during the Holocene - 
an early one involving an endemic Sus species 
(S. celebensis) and a later one involving the 
arrival of domesticated S. scrofa most likely 
brought by migrating early farmers. 

Groves’ claims for an interspecific hybrid 
(S. s. uittatus x S. celebensis) as the basis to 
explain the anthropogenic status and morpho¬ 
logical divergence of New Guinea pigs (Groves, 
1983, 1995) is not supported by either the 
morphometric or genetic data, all of which 
confirm the maternal affinity of the New 
Guinea pigs as a form of S. scrofa. Because 
mtDNA only reveals the maternal heritage, it 
remains possible that male Sulawesi warty pigs 
(S. celebensis) were involved in the creation of 
Pacific Clade pigs found on New Guinea. From 
a morphological perspective, ongoing research 
demonstrates a congruent molar shape diver¬ 
gence in modern specimens from Flores, New 
Guinea and Near Oceania, from their mainland 
South-east Asian relatives, which suggests that 
their molar shape variability has retained a 
common population signature. Random drift 


and directional adaptation linked to insular syn¬ 
drome do not seem to have blurred the signal 
of their population history. The New Guinea 
S. scrofa morphotype is more likely the signal 
of its phylogenetic identity within the mtDNA 
Pacific clade, reflecting its domestic heritage. 

The recent combination of genetic and 
morphometric approaches has significantly 
advanced our understanding of pig domestica¬ 
tion in both South-east Asia and ISEA. These 
studies have conclusively demonstrated that 
peninsular South-east Asia was another centre 
for pig domestication, and that pigs domesti¬ 
cated here were then transported along an 
eastward trajectory into near and then eventu¬ 
ally remote Oceania. This dispersal is linked 
with the spread of early farmers during the 
Neolithic period, and endemic ISEA Sus taxa, 
although hunted, appear to have played little 
or no part in this process, although further 
research will address this issue (Larson et al., 
2007b; Cucchi et a/., 2009). 

India 

The evidence for multiple pig domestication 
centres, particularly for the Near East, Europe, 
China and South-east Asia, is reasonably 
robust. However, there are several other cen¬ 
tres of the Old World where evidence for pig 
domestication and/or introduction of domesti¬ 
cated pigs remains unresolved and where 
recent genetic data provide new evidence. 

Although Sus remains are a widespread 
component of zooarchaeological assemblages 
throughout India and Pakistan, they represent 
such a minor proportion of the domestic and 
wild fauna recovered (Chattopadyaya, 2002) 
that the possibility of an authochthonous 
domestication of S. scrofa in India has rarely 
been considered. Recent genetic data have, 
however, suggested the possibility of an inde¬ 
pendent domestication of Indian wild boar. 
Modern mtDNA data (Larson et al., 2005, 
2010; Tanaka et al., 2008) have demonstrated 
that domestic pigs in India and Bhutan possess 
unique and highly differentiated haplotypes 
identical to those found in Indian wild boar. 
This suggests that Indian domestic pigs were 
probably not derived either from migrating 
East Asian or South-east Asian populations or 
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from Near Eastern or European sources, but 
were the result of a process of local domestica¬ 
tion, though, again, the maternal inheritance 
of mtDNA means that a non-native paternal 
origin of these pigs remains a possibility. 

For the reasons outlined above, virtually 
nothing is known about the history of pig 
domestication in the Indian subcontinent, and 
a temporal context for when local pig domesti¬ 
cation might have occurred remains unex¬ 
plored. If this process did occur, it was most 
likely to have been during the Neolithic and 
linked possibly with the Harrapan culture of 
the North. Detailed biometric evidence for sep¬ 
arating wild and domesticated pigs in this 
region is not available (Meadow and Patel, 
2002; Fuller, 2006), nor are there (as yet) any 
ancient mtDNA sequences that would help to 
resolve the issue. Nevertheless, the application 
of combined ancient DNA and geometric mor¬ 
phometric approaches to further recent and 
fossil material from the region would help to 
resolve these questions. 


North Africa 

Although the majority of the African continent 
is inhabited by numerous suid species (see 
Chapter 1), S. scrofa populations have only 
been reported in North Africa along the 
Mediterranean. In Egypt, wild pigs were last 
reported in the late 19th century, and were 
present during the Christian era, reportedly 
from the Nile Delta, the Gizeh marshes the 
Fayum and Magrah oases, the Wadi el Natrun 
and the Wadi el Gharand (Manilus and Gautier, 
1999). Domestic pigs clearly made up a portion 
of the ancient Neolithic (predynastic) Egyptian 
diet, and the northern delta has been claimed to 
be a centre of intensive pig breeding (Hecker, 
1982). Whether these pigs were the descend¬ 
ants of a local domestication of an indigenous 
population of S. scrofa remains unanswered, 
though the limited fossil and archaeological 
record for wild boar in Egypt implies that they 
were absent during prehistory (Manilus and 
Gautier, 1999), suggesting that domestic pigs 
were introduced, probably from the Near East. 

More recent, though limited, genetic data 
(Larson et a/., 2005, 2007a) demonstrated 


that so-called wild boar samples from Morocco 
and Algeria possessed European haplotypes, 
while samples from Egypt and Sudan possessed 
Near Eastern haplotypes. These data could 
either indicate natural distributions of wild boar, 
extending from the Near East into north-east¬ 
ern Africa, and from south-western Europe into 
north-western Africa, or the presence of feral 
populations of domesticated Near Eastern wild 
boar introduced into North Africa. 

Additional unpublished sequences extracted 
from modern domestic pigs in Egypt (derived 
from a Coptic Christian butcher and claimed 
to be from traditional Egyptian varieties) have 
all produced common European domestic 
haplotypes. These sequences provide further 
evidence for a wholesale replacement of Near 
Eastern pig lineages by European domestic 
pigs that probably took place during later prehis¬ 
tory (Larson eta/., 2007a). The application of 
additional DNA and geometric morphometric 
approaches to these questions will help to 
resolve these outstanding issues. 


Future Directions and Conclusions 

The majority of what has been described above 
has been on the basis of combined zooarchae- 
ological and genetic data, although the genetic 
sequences have been primarily limited to inves¬ 
tigations of variability within the mitochondrial 
genome. This focus on mtDNA has both 
strengths and weaknesses. The benefit of 
employing a neutrally evolving marker that 
does not recombine is that the swift evolution 
of the genome makes it ideal to understand 
population differences within one species, dif¬ 
ferences that have traditionally not been visible 
at the level of the organism. Recent results 
from geometric morphometric analyses are 
now suggesting that genetic differentiation at 
the population level is also visible at the mor¬ 
phological level, and that these results can be 
contrasted with genetic sequences to recon¬ 
struct evolutionary histories of numerous 
domestic species. 

The primary drawback of the focus on 
mtDNA is the flip side of its lack of recombina¬ 
tion. The clonal reproductive nature of mito¬ 
chondria mean that mtDNA analysis within a 
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phylogenetic context is far simpler, but the 
resulting lack of paternal input into its inherit¬ 
ance pattern means that all mtDNA studies are 
limited to understanding the maternal evolu¬ 
tionary history. This is often not a problem, but 
because domestication in general, and pig 
domestication specifically, often involves a sig¬ 
nificant component of inter-population and 
even inter-species hybridization, as is evident in 
the European pig domestication and Island 
South-east Asian narrative above, studies that 
exclusively focus on mtDNA are unable to even 
ask questions related to the degree, timing and 
ramification of that hybridization. 

Two related approaches are changing 
that. First, newly emerging sequencing tech¬ 
nologies are making it ever easier and cheaper 
to generate large numbers of both genetic 
sequences and individual single nucleotide 
polymorphisms (SNFs), as discussed in other 
chapters of this book. Secondly, comparative 
and functional genomic approaches using 
these data mean that many of the genes that 
are responsible for the phenotypic, behavioural 
and physiological differences evident between 
wild boar and domestic pigs are beginning to 
be found. 

Though a number of other chapters in this 
book discuss the details of those genes, one 
specific locus is worth mentioning here. 
Following on from several publications that 
described the genetic variability in the pig 
melanocortin receptor 1 (MC1R) and corre¬ 
lated individual causative SNPs with specific 
coat colours, Fang et al. (2009) went one step 
further by typing more than 80 individuals 
from numerous locations across both Europe 
and China and, in doing so, identified 16 hap- 
lotypes. The most remarkable pattern that 
emerged from these data was not the fact that 
Asian wild boar harboured seven haplotypes 
while all European wild boar shared a single 
haplotype; because wild boar dispersed out of 
ISEA and into East Asia before continuing west 
towards Europe, they had spent a considerably 
longer period of time in East Asia, during which 
they had accumulated more genetic variation 
(Larson et a/., 2005; Megens et a/., 2008). 
What was unexpected was that, of all the five 
European and three Asian haplotypes found in 
domestic pigs, every one contained at least one 
non-synonymous mutation that altered the 


MC1R protein and, thus, the coat colour of the 
pigs. 

These results first demonstrated that there 
is a strong selective pressure against non¬ 
camouflage coat colours in nature, and that 
humans must have exerted a significant posi¬ 
tive selection for variation in order to generate 
large enough populations of all-black domestic 
pigs in order that the initial mutated sequences 
could act as the template on which additional 
mutations could be added. The fact that black 
domestic pigs exist in both Europe and China 
but achieve this coat colour through different 
mutations to the MC1R locus strongly suggests 
that the penchant for novelty is a universal 
trait, the consequences of which are evident in 
all our domestic animals. This kind of study 
goes a long way towards answering some of 
the basic questions that surround domestica¬ 
tion, including the debate as to whether the 
process acts on standing variation found in wild 
populations or assists the spread and fixation 
of mutations that occur after domestication has 
begun (Fang et al., 2009). 

What remains unknown, however, is when 
the domestic haplotypes began to appear and 
how quickly they spread through pig popula¬ 
tions and across Europe and Asia. By typing 
the causative mutations in ancient material, it 
may soon be possible to reconstruct not just 
the coat colour, but also many other character¬ 
istics, including developmental and behavioural 
traits of archaeological pig specimens, thus 
adding the missing temporal context. This type 
of approach has already been carried out in 
horses, the results of which demonstrated that 
coat colour variation increased substantially 
5000 years ago, at the same time that horse 
domestication becomes evident in the archaeo¬ 
logical record (Ludwig et al., 2009). 

Despite the progress that has been made 
over the past decade, a significant number of 
fundamental questions remain unanswered. 
The production of ‘improved breeds’ in the 
19th century through the intentional mating of 
Asian and European pigs may have been just 
the most recent instance of hybridization 
(intentional or otherwise) between different 
species and subspecies of S. scrofa. As genetic 
studies move beyond the mitochondrial 
genome, assessing how much of an effect 
hybridization has had on global pig populations 
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becomes feasible. By increasing the power and 
resolution of genetic data sets, we will begin to 
further unravel 10,000 years of selection and 
interbreeding, the results of which will have a 
large impact on what we know, and how we 
can tackle issues associated with commercial 
pig production, the conservation of wild boar 


populations and even the appearance and 
spread of pathogens that infect both pigs and 
people. Thus, the combination of the new gen¬ 
eration of genetic and morphological tech¬ 
niques has the power to inform the past, 
present and future of our long-standing rela¬ 
tionship with pigs. 
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Introduction 

Genetic studies of coat colour variation in pigs 
were initiated soon after Mendel’s laws of inher¬ 
itance were rediscovered in around 1900. 
Spillman (1906) first described the dominant 
inheritance of white colour, and this was fol¬ 
lowed by a study of the inheritance of the white 
belt phenotype in the following year (Spillman, 
1907). Sewall Wright had already published the 
first review on the inheritance of coat colour 
variation in pigs in 1918 (Wright, 1918). This 
was followed by a number of studies that estab¬ 
lished the basic principles of pig coat colour 
genetics. H.O. Hetzer made a major contribu¬ 
tion to this field when he published a series of 
papers during the 1940s and 1950s analysing 
the segregation of coat colour in a number of 
crossbreeding experiments (Hetzer, 1945a,b,c; 
reviewed by Legault, 1998). The classical 
genetics of coat colour variation in pigs has 
been reviewed by several authors (Ollivier and 
Sellier, 1982; Legault, 1998). 

Before the era of molecular genetics, it 
was proposed that a set of major coat colour 

38 


47 

48 


loci designated A, B, C, D, E, P and S 
(Table 3.1) controlled coat colour variation in 
mammals. Castle and Little (1909) were the 
first to introduce the concept of a set of major 
genetic factors affecting coat colour, and this 
was further developed by Sewall Wright in a 
series of seminal papers (e.g. Wright 1917, 
1918). Subsequent molecular studies have to a 
large extent confirmed previous assumptions 
of gene homologies between species based on 
strikingly similar effects on the coat colour phe¬ 
notype (Table 3.1). However, further genetic 
studies have revealed that mutations in a fairly 
large number of genes may affect coat colour. 
In the mouse, for example, 159 genes affecting 
pigmentation have been described so far 
(Montoliu et a/., 2009). 

The first molecular study on pig coat 
colour genetics was published in 1992, when 
close linkage between the gene for Dominant 
white colour and the ALB and PDGFRA genes 
on pig chromosome 8 was reported (Johansson 
et a/., 1992). Since then, more than 20 molec¬ 
ular studies on pig coat colour genetics have 
been published and three major coat colour 

©CAB International 2011 . The Genetics of the Pig, 
2nd Edn (eds M.F. Rothschild and A. Ruvinsky) 
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Table 3.1. The classical coat colour loci in mammals and their corresponding genes as identified 
in the mouse. 



Locus 


Gene 


Symbol 

Name 

Symbol 3 

Name 

Effect on pigmentation 

A 

Agouti 

As ip 

Agouti signalling 
protein 

Relative distribution of eumelanin and 
pheomelanin 

B 

Brown 

Tyrpl 

Tyrosinase-related 
protein 1 

Changes black eumelanin to brown 
eumelanin 

C 

Albino 

Tyr 

Tyrosinase 

General lack of pigmentation in hair, skin 
and eye 

D 

Dilution 

Myo5a b 

Myosin type Va 

Dilution of pigment 

E 

Extension 

Mclr 

Melanocortin 1 
receptor 

Relative distribution of eumelanin and 
pheomelanin 

P 

Pink-eyed 

dilution 

Oca2 

Oculocutaneous 
albinism II 

Diluted pigmentation in hair, skin and eye 

S 

Spotting 

Edn rtf 

Endothelin receptor 
type B 

White spotting due to defect in 
melanocyte migration 


a ln mouse nomenclature, only the first letter in a locus symbol is capitalized, whereas in humans and pigs locus symbols 
are written with all capitals. 

b Mutations in other genes may give very similar dilution of coat colour. 

c Mutations in other genes give similar white spotting phenotypes. In the pig, the KIT gene is the major locus for white 
colour and white spotting. 


loci in the pig have been studied at the molecu¬ 
lar level: the Dominant white (I) locus, the 
Extension (E) locus and the Agouti (A) locus. 
These studies have shown that most of the 
coat colour diversity present among major pig 
breeds, at least those of European origin, is 
explained by genetic variation at the Dominant 
white and Extension loci. The major aim of 
this chapter is therefore to review these molec¬ 
ular studies and their implications for our 
understanding of coat colour variation in pigs. 

The Dominant White (I) Locus - 
Extensive Allelic Diversity 

Dominant white is one of the most widespread 
coat colour variants in European domestic 
pigs, and major breeds such as Yorkshire/ 
Large White and Landrace are assumed to be 
fixed for the Dominant white (I) allele, although 
this is not always the case (see below). Some 
pig meat products are marketed including the 
skin, and in many markets there is a strong 
consumer preference for products with white 
skin. This, at least in part, relates to the per¬ 
ception that the dark hair roots that are present 


after the dehairing of coloured breeds indicate 
a hygiene (mould) problem. Therefore, the 
common practice in pig breeding is often to 
use a dam that is homozygous for the Dominant 
white allele to ensure that the production ani¬ 
mals have white skin. In skin-on markets, such 
animals require less processing post slaughter, 
representing significant cost savings, especially 
from yield loss (Cannon et al., 1996). 

Wentworth and Lush (1923) proposed 
that Dominant white colour is caused by a sin¬ 
gle dominant gene, and this was later confirmed 
by Hetzer in his analysis of crosses between col¬ 
oured and white breeds (Hetzer, 1945a,b,c). 
Hetzer also proposed the allele designation I 
for inhibition of colour. The molecular charac¬ 
terization of the Dominant white locus was 
initiated by the assignment of this locus to the 
centromeric region of chromosome 8 based on 
segregation analysis of an intercross between 
the European wild boar and Large White pigs 
(Johansson et ah, 1992). The segregation 
analysis of the white colour in this cross was 
in accordance with expectation, i.e. a 3:1 
segregation between white and coloured 
progeny in the F 2 generation, but with one 
exception (Plate 1A); one of the Large White 
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founder animals carried an alternative allele, 
which was named Patch (J p ), and F x animals 
that were heterozygous (I P /i) showed a white 
spotted phenotype in contrast to the pure white 
phenotype observed for the 1/i heterozygotes. 
The segregation data in the F 2 generation con¬ 
firmed that the Fateh phenotype was caused by 
an alternative allele at the Dominant white 
locus (Johansson et ah, 1992). 

The assignment of the Dominant white 
locus to the centromeric region of chromosome 
8 immediately revealed the KIT gene as a 
strong positional candidate locus (Johansson 
et ah, 1992). A few years earlier the murine 
homologue, Kit , had been identified as the 
gene underlying Dominant white spotting 
(Chabot et ah, 1988; Geissler et at. , 1988). 
Kit encodes a tyrosine kinase receptor that has 
a crucial role for the migration and survival of 
melanocyte precursor cells originating from the 
neural crest, as well as for haematopoiesis and 
germ cell development. The Kit ligand (KITL) 
has previously been denoted Mast cell growth 
factor (MGF), and mutations in the gene encod¬ 
ing KITL in mice also cause pleiotropic effects 
on pigmentation, haematopoiesis and fertility 
(Rothschild et ah, 2003). The porcine KITL 
gene has recently been characterized, but no 
significant role in determining coat colour vari¬ 
ation has yet been revealed (Okumura et ah, 
2006; Fladjiconstantouras et ah, 2008). 

Molecular studies revealed that both the 
Dominant white (I) and the Patch (I p ) alleles 
are associated with a duplication of the KIT 
locus (Johansson Moller et al. , 1996). A small 
deletion was detected in intron 18 and segrega¬ 
tion analysis showed that both the deleted form 
and the normal form were transmitted together 
with the I and I p alleles. In order to determine 
the size of the duplication, a BAC contig (a set 
of overlapping bacterial artificial chromosome 
clones) was established across the Dominant 
white/KIT locus, and this revealed a 450kb 
duplication that encompasses all coding exons, 
about 150 kb of the upstream region and about 
200 kb of the downstream region (Giuffra et al ., 
2002). The sequences in the vicinity of the 
duplication break-points indicated that the 
duplication had occurred as a result of a 
recombination event between LINE sequences 
flanking the KIT locus. Based on comparisons 
with the characterization of the functional 


consequences of similar rearrangements associ¬ 
ated with Kit mutations in mice, it was proposed 
that the duplication acts as a regulatory muta¬ 
tion (Giuffra et al. , 2002). The duplicated KIT 
copy may show dysregulated expression 
because it is physically separated from some of 
the regulatory sequences located at a far 
distance from the coding sequence. 

The observation that the Dominant white 
and Patch alleles carried the same duplication 
implied that at least one other causative muta¬ 
tion must exist in order to explain the pheno¬ 
typic difference associated with these alleles. 
An examination of KIT transcripts in white 
blood cells revealed the presence of two splice 
forms in pigs carrying the Dominant white 
allele, one normal and one variant form lacking 
exon 17 (Marklund et ah, 1998). Sequencing 
of exon 17 and the flanking introns revealed a 
single base substitution in the first nucleotide of 
intron 17. It was concluded that this must be 
the mutation leading to the skipping of exon 17 
because the splice junctions are essential for a 
normal splice pattern (Marklund et al. , 1998). 
Further analysis revealed that the Dominant 
white allele carries one copy of the KIT gene 
with a normal intron 17 sequence, while the 
second KIT copy has a mutant form of intron 
17 consistent with the transcript pattern; in 
contrast, in Patch both KIT copies have the 
normal form of intron 17. Exon 17 codes for a 
part of the tyrosine kinase domain that is essen¬ 
tial for KIT signalling. The data therefore indi¬ 
cated that the mutant KIT sequence, the one 
with the splice mutation, encodes a receptor 
with normal ligand binding but with no or dras¬ 
tically reduced signalling capacity. Thus, the 
combined effect of the duplication (most likely a 
regulatory mutation, as discussed above) and 
the splice mutation explains the unique features 
of the Dominant white allele. It has a more 
drastic effect on pigmentation in the hetero¬ 
zygous condition than any mouse Kit mutation, 
but homozygotes are still fully viable in contrast 
to many mouse Kit alleles that are lethal or 
semi-lethal in the homozygous condition. The 
reason why the Dominant white allele is fully 
viable despite a major effect on pigmentation is 
that the presence of at least one functional KIT 
copy per chromosome provides sufficient KIT 
signalling to avoid the severe pleiotropic effects 
on haematopoiesis and fertility observed in 
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animals lacking KIT function completely. 
However, the Dominant white allele is associ¬ 
ated with a mild reduction in the number of red 
and white blood cells (Marklund et a/., 1998; 
Johansson et a/., 2005). Mild negative effects 
on fertility have not been documented but it 
cannot be excluded that such effects exist. 

The presence of two causative mutations 
at the KIT locus suggested an evolutionary sce¬ 
nario where the duplication occurred first and 
resulted in a partially white coat colour. The 
occurrence of the splice mutation then enhanced 
the white phenotype. This is the most plausible 
explanation because comparative mouse data 
show that loss-of-function mutations at the Kit 
locus give a mild coat colour phenotype in the 
heterozygous condition but they are lethal in 
the homozygous condition, whereas regulatory 
mutations can be both fully dominant and fully 
viable, even in homozygotes. 

The relatively complex structure of the KIT 
locus, including the duplication and the splice 
mutation, and the interest in accurately scoring 
the Dominant white genotype to ensure 
homozygosity in animals used for breeding 
called for a very accurate genotyping method. 
Pyrosequencing was employed to establish such 
a method (Pielberg et ah, 2002, 2003). An 
alternative method based on the oligonucleo¬ 
tide ligation assay was subsequently described 
(Seo et a/., 2004). These studies revealed 
extensive haplotype diversity at the KIT locus 
among commercial pig populations assumed to 
be homozygous for the Dominant white allele. 
Haplotypes with one, two or three copies were 
found, and the splice mutation occurred in 


none, one or two of the copies (Table 3.2). 
Furthermore, an allele carrying a single copy 
but with the splice mutation was found, and 
denoted I L , because it is expected to be lethal in 
the homozygous condition owing to a severe 
defect in KIT signalling (Pielberg et a/., 2003). 
The extensive genetic heterogeneity at the KIT 
locus in pigs strongly suggested that it is gener¬ 
ated by unequal crossing over during meioses, 
but the rate of unequal crossing over at this 
locus has not yet been established. This explains 
why many white lines still segregate for 
Dominant white despite breeders having 
attempted for more than 200 years to breed 
the white allele to homozygosity. The results 
also imply that white pigs that become feral will 
return to wild type at the KIT locus because 
single-copy alleles lacking the splice mutation 
will be generated by unequal crossing over and 
then favoured by natural selection as pigmenta¬ 
tion provides camouflage and protection against 
the damaging effects of UV light. 

The Belt allele was assigned to the KIT locus 
on the basis of classical segregation analysis using 
an intercross between Hampshire (belted) and 
Pietrain (non-belted) pigs (Giuffra et ah, 1999). 
The entire coding sequence of KIT associated 
with Belt had been determined in a previous 
study and did not reveal any putative causative 
mutation (Marklund et ah, 1998). These results 
strongly suggest that the Belt phenotype is caused 
by a regulatory mutation, and it is well established 
that some of the regulatory elements controlling 
KIT expression during development are located 
more than 100 kb away from the coding sequence 
(Berrozpe et ah, 1999). 


Table 3.2. Summary of the characteristic features of alleles at the Dominant white (l)/KIT locus in pigs. 


Allele 

No. of KIT copies 

No. of normal copies 

No. of copies with 
splice mutation 

Name 

Designation 

Wild-type 

• 

/ 

1 

1 

0 

Belt 

/Be 

1 

1 

0 

Roam 

jRn 

1 

1 

0 

Patch 

l P 

2 

2 

0 

Dominant white 

I 1 

2 

1 

1 

Dominant white 

I 2 

3 

2 

1 

Dominant white 

I 3 

3 

1 

2 

Lethal 

l L 

1 

0 

1 


a The existence of this allele needs to be confirmed once the causative mutation(s) for the Belt allele has been identified. 
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A fourth KIT allele, in addition to wild- 
type, Patch and Dominant white, was detected 
in the intercross between European wild boar 
and Large White pigs (Pielberg et al., 2002). 
Like the Belt allele, this allele was associated 
with a single KIT copy and no splice mutation, 
but it showed a roan phenotype (white and col¬ 
oured hairs intermingled) rather than a belted 
phenotype. This represents a tentative addi¬ 
tional KIT allele denoted Roan (I Rn ) but at 
present it cannot be formally excluded that 
Belt and Roan represent the same allele show¬ 
ing different phenotypic expressions on differ¬ 
ent genetic backgrounds. This can be resolved 
easily once the causative mutation for the Belt 
phenotype has been identified. The alleles 
described so far at the Dominant white locus 
are summarized in Table 3.2. 


The Extension (E) Locus - 
Contrasting Selection Pressures 
in Wild and Domestic Pigs 

Classical genetics established an allelic series 
comprising four alleles at the Extension locus 
in pigs (Legault, 1998). These are, with this 
order of dominance, Dominant black (E D ), 
wild-type (E + ), black spotting (E p ) and reces¬ 
sive red (e). In the F 2 generation of a cross 
between European wild boars (E + /E + ) and 
Large White pigs segregating both at the 
Dominant white and Extension loci, wild type 
colour (E + /-) and black spotting (E p /E p ) segre¬ 
gated in accordance with the expected 3:1 
ratio among the progeny that were homozygous 
i/i at the Dominant white locus, and E was 
assigned to pig chromosome 6 by linkage 
analysis (Mariani et al. , 1996). The MC1R 
gene encoding the melanocortin 1 receptor 
was identified as the obvious candidate gene 
because an earlier study had shown that the 
Extension locus corresponds to the Mcl r gene 
in mice (Robbins et al. , 1993; Table 3.1). 
MC1R signalling determines the switch 
between production of black/brown eumelanin 
and the production of red/yellow pheomelanin. 
MC1R signalling is activated by the binding of 
the ligand melanocyte-stimulating hormone 
(MSH) to the receptor leading to production 
of black pigment, while the agouti signalling 


protein (ASIP) encoded by the Agouti locus 
inhibits MC1R signalling and thereby favours 
the production of red pigment. A typical wild- 
type coat colour in mammals, like the one in 
the wild boar, is composed of a mixture of 
eumelanin and pheomelanin, and individual 
hairs may be striped black and yellow owing to 
the temporal activation of ASIP production 
during the hair cycle. Mutations causing 
constitutive activation of MC1R signalling 
cause dominant black colour, whereas loss-of- 
function mutations are associated with recessive 
red colour. 

In an initial analysis, the entire MC1R cod¬ 
ing sequence (except the first 120 nucleotides) 
was determined in pigs representing different 
Extension alleles (Kijas et al., 1998). This 
analysis immediately confirmed that the 
Extension locus also harbours the MC1R gene 
in pigs because a number of missense muta¬ 
tions showing complete association to the dif¬ 
ferent Extension alleles were revealed. Two 
different sequences were associated with 
Dominant black colour, and it was apparent 
that one, denoted E D1 , had an Asian origin, 
while the other, denoted E D2 , clearly had a 
European origin. The MC1R sequence associ¬ 
ated with the latter allele was identical to the 
one found in the European wild boar except 
for a single non-conservative missense muta¬ 
tion, Aspl24Asn (Table 3.3), which is almost 
certainly the causative mutation (Kijas et al., 
1998). The Asian Dominant black allele was 
associated with two missense mutations, 
Val95Met and Leul02Pro (Table 3.3), and it is 
assumed that the latter is the prime causative 
mutation because the same missense mutation 
in the corresponding position causes dominant 
black colour in cattle (Klungland et al., 1995). 
The e allele causing recessive red colour, for 
instance in Duroc pigs, is also associated with 
two missense mutations, Alal64Val and 
Ala243Thr (Table 3.3), and it has not yet been 
established if one of these or the combined 
effect of the two is the cause of red colour. 

A surprising finding in this initial study 
was that the MC1R sequence of the E p allele 
for black spotting was identical to the one 
associated with the Dominant black allele 
(E D2 ) of European origin (Kijas et al., 1998). 
However, a subsequent study in which 
the entire coding sequence of MC1R was 



Table 3.3. Sequence alignment of pig MC1R/E alleles (— indicates identity with the master sequence). 











Codon 









MC1 R a /E 

4 

17 

21 

22 

95 

102 

117 

121 

122 


124 

164 

166 

243 

301 


0101/ E + 

C T T 

G C G 

G C C 

C C C 

G T G 

C T G 

C A G 

A A T 

G T C 

G 

A C 

G C G 

C G G 

G C G 

T A 

C 


Leu 

Ala 

Ala 

Pro 

Val 

Leu 

Gin 

Asn 

Val 


Asp 

Ala 

Arg 

Ala 

Tyr 


0102/ E + 

. 

. 

. 

. 

. 

. 

. 

. . C 

. 

■ 

■ 

. 

. 

. 

• 

■ 

0103/ E + 

. . C 

■ 

. 

. 

. 

. 

. 

. . C 

. 

■ 

■ 

. 

. 

. 

■ 

■ 

0104/ E + 

. 

. . A 

. 

■ 

. 

. 

. 

. . C 

. 

■ 

. T 

. 

. 

. . A 

• 

T 

0105/ E + 

. 

. . A 

. 

. 

. 

. 

. . A 

. . C 

• 

■ 

■ 

. 

. 

. . A 

■ 

T 

0201/ E D1 

. 

. . A 

. 

. 

A . . 

Met 

A . . 

. C . 

Pro 

. C . 

. 

. . C 

. 

■ 

■ 

. 

. 

. . A 

• 

■ 

0202/ E D1 


. . A 




. . C 

A . . 





. . A 





— 

— 

— 

Met 

Pro 

— 


lie 


— 

— 

— 

— 

— 


0203/ E D1 

. 

. . A 

. 

. 

A . . 

Met 

. C . 

Pro 

. 

. . C 

. 

■ 

. T 

. 

. 

. . A 

■ 

■ 

0301/ E 02 

. 

. 

. 

. 

. 

. 

. 

A 

Asn 

. 

. 

. 

• 

■ 

0401/ e 

. 

. 

. 

. 

. 

. 

. 

. 

. 

• 

. T . 

Val 

. 

A . . 

Thr 

• 

■ 

0501/ E p 




+CC 






A 






— 

— 

— 

FS b 

— 

— 

— 


— 


Asn 

— 

— 

— 

— 


0502/ E p 

. . . 

. . . 

A . . 

+CC 

. . . 

. . . 

. . . 

. . . 

. . . 

A 

. . 

. . . 

. . . 

. . . 

. . 

. 


— 

— 

Thr 

FS b 

— 

— 

— 


— 


Asn 

— 

— 

— 

— 


0503/ E p 

. 

■ 

. 

+CC 

. 

. 

. 

. 

. 

A 

■ 

. 

T . . 

. 

■ 

■ 



— 

— 

FS b 

— 

— 

— 


— 


Asn 

— 

Trp 

— 

— 



a The nomenclature for MC1R alleles is based on the system proposed by Fang etal. (2009). 
b FS = frameshift, sequence out of frame after codon 22. 
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determined revealed that the E p allele also 
carries an insertion of two C nucleotides at 
codon 22, in addition to the missense muta¬ 
tion Aspl24Asn (Kijas et a/., 2001; Table 
3.3). The two-base-pair insertion constitutes a 
frameshift mutation and is thus expected to be 
a complete loss of function that should cause 
recessive red colour according to the current 
knowledge of MC1R function. In fact, some of 
the pigs that are homozygous for this allele 
show a uniform red colour, although the great 
majority shows a black spotting phenotype 
(Plate 1A); how can this be explained? The 
fact that the CC dinucleotide insertion occurs 
in a stretch of six C nucleotides, creating a 
short eight-base-pair mononucleotide repeat, 
implied that this mutation may be somatically 
unstable during melanocyte development. In 
order to test this hypothesis, reversed tran¬ 
scriptase (RT)-PCR analysis was performed 
from skin with black spots as well as skin with 
red spots; the results showed that somatic 
reversions had occurred in areas with black 
pigmentation so that in all cases the 8C had 
shrunk to 6C, which restores the reading 
frame, and the Dominant black mutation at 
residue 124 becomes reactivated! 

The E p /E p genotype has a very variable 
phenotypic expression and may be associated 
with uniform red colour, black spotting on a 
red background colour or black spotting on a 
white background. One of the most extreme 
forms is found in Berkshire pigs, which show 
a uniform black colour but with white feet, 
nose and tail. Sewall Wright (1918) had 
already proposed that the black colour of 
Berkshire pigs constitutes an extreme form of 
black spotting due to the action of modifying 
genes. The presence and extension of black 
spots in E p /E p pigs can be affected both by 
factors/mutations that enhance the frequency 
of somatic mutations and by factors/muta¬ 
tions that stimulate melanocyte migration sub¬ 
sequent to the occurrence of a somatic 
mutation. A very intriguing observation is that 
the black spots are consistently larger on a 
white background than on a red background 
colour (Plate 1A). A possible explanation for 
this could be that the red areas contain 
melanocytes producing red pheomelanin, 
while the white areas lack melanocytes com¬ 
pletely, which allows the revertant melano¬ 


cytes to expand more and form larger black 
spots. However, no functional data support¬ 
ing this suggestion are yet available and it is 
still not known whether the white areas lack 
melanocytes or contain melanocytes that are 
unable to produce pigment. 

It can be concluded that pigs with white 
background colour and black spots have a geno¬ 
type that does not support pigment production 
in the absence of MC1R signalling. The genetic 
difference between pigs with black spotting on a 
red or white background colour has not yet been 
resolved. These two phenotypes segregated in 
an intercross between European wild boars and 
Large White domestic pigs (Plate 1A) and a 
careful examination of the segregation data 
showed that this difference is not controlled by 
the MC1R locus as full siblings sharing the same 
E p alleles may differ in background colour 
(Mariani et a/., 1996). The ratio of the two phe¬ 
notypes did not fit any simple Mendelian inherit¬ 
ance ratio and no linkage was detected. This 
implies either that the phenotypes have a more 
complex genetic background or that one or 
both of the parental strains are not fixed at a 
putative locus underlying this trait. 

A significant spin-off finding of the char¬ 
acterization of the MC1R locus was that it 
clearly indicated a substantial genetic distance 
between Asian and European pigs because 
the Asian E m allele differed from the E + allele 
and all other E alleles of European origin by 
two synonymous substitutions, in addition to 
the two non-synonymous substitutions. The 
implication was that Asian domestic pigs 
originate from Asian wild boars whereas 
European domestic pigs originate from 
European wild boars. Furthermore, the pres¬ 
ence of the Asian allele for dominant black 
colour in some European breeds most likely 
reflects the documented introgression of 
Asian pigs into European stocks primarily 
during the 18th and 19th centuries (Kijas 
et a/., 1998). This was subsequently 

confirmed in follow-up studies including the 
analysis of mtDNA (mitochondrial) and 
nuclear genes, which indicated a considerable 
genetic distance between Asian and European 
wild boars (Giuffra et a/., 2000; Kijas and 
Andersson, 2001). This finding has been 
further corroborated in many subsequent 
studies (see Chapter 2). 
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A comprehensive screening of genetic 
diversity at the MC1R locus was accom¬ 
plished by sequencing the entire coding 
sequence from pigs representing 31 popula¬ 
tions of European pigs, 15 breeds of Chinese 
pigs, and European and Asian wild boars 
(Fang et ai, 2009). This resulted in several 
interesting findings. First, the E + wild-type 
allele was only present in one of the European 
breeds, the Mangalitsa. Interestingly, this is 
one of the very few breeds in which the pig¬ 
lets are striped, which is a characteristic fea¬ 
ture of wild boar piglets. This implies that 
MC1R is a domestication locus in pigs as 
almost all breeds are fixed for a mutant form 
of MC1R. Furthermore, it indicates that an 
MC1R wild-type sequence is required to pro¬ 
duce the striping pattern in piglets. The 
authors have also seen dorsoventral striping 
in other crosses, suggesting that there are 
further alleles segregating in some breeds. 
Secondly, a total of seven nucleotide substi¬ 
tutions was found among European and 
Asian wild boars, and all were synonymous, 
i.e. they did not change the amino acid 
sequence (Plate IB). In sharp contrast, nine 
out of ten mutations only found among 
domestic pigs affected the coding sequence, 
and some alleles differed by up to three non- 
synonymous substitutions from their closest 
wild-type allele (Plate IB). The ratio of the 
relative frequency of non-synonymous (d N ) 
and synonymous (d s ) substitutions was as 
high as 23.5 among domestic pigs, whereas 
the d N /d s ratio was 0 among the wild boars. 
This provided evidence for purifying selec¬ 
tion in the wild, most likely to maintain cam¬ 
ouflage colour, but strong directional selection 
against camouflage at the farm. Possible 
explanations for this selection in domestic 
pigs are that: (i) a non-camouflaged coat col¬ 
our may have facilitated early animal hus¬ 
bandry; (ii) coat colour variants have been 
used as markers to easily distinguish domes¬ 
ticated forms from wild forms; or (iii) simply 
because the phenotypic diversity created by 
coat colour variants has been appealing to 
humans as we can appreciate fashion. 

This study of MC1R diversity among pig 
populations has important implications. It 
demonstrates how strong the selection pres¬ 
sure can be in domestic animals and that the 


history of domestication is sufficiently long for 
some alleles to represent the accumulation of 
several consecutive mutations with phenotypic 
effects. The E p allele is an example of an allele 
carrying two causative mutations, the missense 
mutation (Aspl24Asn) and the two-base-pair 
insertion at codon 22. Domestication is very 
recent from an evolutionary perspective - 
within the last 10,000 years - which means 
that there has not been sufficient time to accu¬ 
mulate many synonymous substitutions. This is 
well illustrated by the fact that not a single syn¬ 
onymous substitution in the MC1R gene has 
yet been verified among European wild boars 
and European domestic pigs. This finding is 
consistent with other data indicating that the 
European wild boar went through a population 
bottle-neck before domestication (Fang and 
Andersson, 2006; Fang et a/., 2009). 

This MC1R study provided conclusive evi¬ 
dence that intentional human selection has 
acted directly to change the coat colour. The 
very strong selection signature refutes the pos¬ 
sibility that the coat colour has changed simply 
as a result of relaxed purifying selection subse¬ 
quent to domestication. Furthermore, MC1R 
has no known function outside the pigment 
cell, making it exceedingly unlikely that the 
coat colour has been altered as a by-product of 
selection for another trait, for instance behav¬ 
iour, as suggested in some other studies (see 
Fang et a/., 2009). 


The Agouti (A) Locus - Still Poorly 
Characterized at the Molecular Level 

Agouti is one of the classical coat colour loci 
in mammals and it determines, together with 
the Extension locus, the relative proportion of 
red/yellow pheomelanin and brown /black 
eumelanin in pigmentation (Table 3.1). The 
correct gene designation is now AS/P, as this 
gene encodes agouti signalling protein (ASIP), 
which is an MC1R antagonist that interferes 
with the interaction between MSH and MC1R. 
Therefore, loss-of-function alleles at the ASIP 
locus cause recessive black colour, because 
MSH is free to bind MC1R throughout the 
hair cycle, whereas ASIP overexpression leads 
to dominant yellow colour, because ASIP 
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blocks MC1R. The ASIP locus in pigs has not 
been characterized to the same degree as the 
KIT and MC1R loci, but Drogemuller et ai 
(2006) have reported that the black-and-tan 
phenotype segregating in a Mangalitsa x 
Pietrain intercross is determined by an allele at 
the ASIP locus, and proposed that the allele is 
designated ai No sequence difference in the 
coding region was detected between the a £ and 
the A + wild-type alleles, but five ASIP tran¬ 
scripts showed differential expression between 
genotypes at this locus in dorsoventral skin, 
strongly implicating that a 1 is caused by a regu¬ 
latory mutation. It is well established that the 
expression of ASIP has a complex regulation 
and is affected by multiple regulatory elements 
spread over a fairly large genomic region 
(Vrieling et ai, 1994). 


Other Loci 

As well as the three loci described in detail in 
the present review, Legault (1998) reviews 
data suggesting the existence of genetic varia¬ 
tion at five additional loci that may affect coat 
colour variation in pigs: the Brown (B) locus, 
the Albino (C) locus, the Dilution (D) locus, 
the Red-eye (R) locus and the White head (He) 
locus. The three former loci are assumed to 
correspond to the classical B, C and D loci in 
other mammals (Table 3.1). In fact, a recent 
study demonstrated that brown colour in 
Chinese Tibetan, Kele and Dahe pigs is caused 
by a six-base-pair deletion in TYRP1 , which 
encodes tyrosinase-related protein 1 (Lusheng 
Huang, personal communication). No conclu¬ 
sive evidence for genetic variation at the 
Albino (C) locus has yet been reported, but 
Searle (1968) proposed that dirty white colour 
in Mangalitsa pigs maybe due to an allele at 
this locus. Similarly, no conclusive evidence 
exists for genetic variation at the D locus in 
pigs (Legault, 1998), which corresponds to 
the MY05A gene, which encodes for Myosin 
type Va (Table 3.1). Roberts and Krider (1949) 
identified an autosomal recessive allele (r) 
causing red eye pigmentation and dilution of 
coat colour in Hampshire pigs. Fernandez et 
al. (2003, 2005) investigated genetic varia¬ 
tion in the OCA2 gene, which encodes for 


oculocutaneous albinism II and corresponds to 
the classical pink-eyed dilution (P) locus (Table 
3.1); they found that this locus had an additive 
polygenic effect on colour intensity in red 
Iberian pigs. Finally, Legault (1998) reviewed 
the evidence for a White head (He) allele 
showing dominant inheritance. An obvious 
candidate gene for this phenotype is the KIT 
locus, and it cannot be excluded that the phe¬ 
notype reflects the expression of one of the 
many KIT alleles (see Table 3.2) on a certain 
genetic background. 

In addition, linkage studies such as that of 
Hirooka et ai (2002) point to a number of 
‘modifying loci’ that affect coat colour. However, 
these loci need to be confirmed by more detailed 
molecular studies. For example, a candidate for 
one of these modifying loci is the KIT ligand 
(see above in relation to Dominant white/KTT), 
although subsequent analysis of this locus failed 
to find any association with variation in coat 
colour (Hadjiconstantouras et ai, 2008). This 
need is also well illustrated by the fact that the 
Belt phenotype, for instance, that is exhibited 
by Hampshire pigs was assumed to represent 
an allele at a separate locus until it was shown 
that it is controlled by an allele at the Dominant 
white locus (Giuffra et ai, 1999). Similarly, 
Bushnell (1943) reported loose genetic linkage 
between the Belt and Extension loci, but this 
can now be refuted because subsequent molec¬ 
ular studies have shown that the Dominant 
white (Belt) and Extension loci are located on 
different chromosomes - chromosomes 8 and 
6, respectively. Thus, access to molecular data 
provides an excellent opportunity to test 
hypotheses of inheritance patterns of coat col¬ 
our variation. 


Coat Colour Variants 
as Breed-specific 
Genetic Markers 

The identification of the two major loci for coat 
colour, Dominant white and Extension , and the 
characterization of variation in these genes across 
breeds has provided a useful tool for the verifica¬ 
tion of breeds and the products derived from 
these breeds (e.g. Alderson and Plastow, 2004; 
also reviewed more generally by Dalvit et ai, 
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2007). This utility is of course based on the selec¬ 
tion of physical uniformity of type (i.e. different 
breed-specific morphological characters such as 
ear type, head shape and coat colour) as part of 
setting specific breed standards during the devel¬ 
opment of a breed (Alderson and Flastow, 2004). 
Dalvit et al. (2007) indicated that the use of col¬ 
our loci is ‘deterministic’ and therefore more 
straightforward than the use of ‘probabilistic 
approaches’. This has proven to be very effective 
in certain situations where the question of origin 
can be framed to match the information available 
for a breed, the likely source of non-conforming 
product, and the probability of finding rare but 
uncharacterized breeds that negate the conclu¬ 
sion. The need for such a tool increases as prod¬ 
ucts become specified to specific regions or types 
of production that include a genetic element. 
Jambon Iberico in Spain and Kuro Buta in Japan 
are examples of pork products that specify breed 
type, and Dalvit and colleagues (2007) state that 
‘breed traceability is a means to defend and valor¬ 
ize particular food products’. A number of differ¬ 
ent studies have investigated this application and 
the ‘keys’ derived from them continue to be 
applied in several different situations. For exam¬ 
ple, Carrion et a I. (2003) discuss examples for 
wild boar and Iberian pigs as well as Kuro Buta 
(Berkshire) in Japan, while Alderson and Flastow 
(2004) give examples for Tamworth as well as 
Berkshire. Other examples include replacing test 
mating to remove ‘red alleles’ introduced into the 
Hampshire breed in the USA and the authentica¬ 
tion of Iberian pork products using MC1R and 
other markers such as a synonymous substitution 
at the OCA2 locus (Fernandez et al., 2004). 


Some Remaining Questions 

to Resolve 

The coat colour variation present among 
European pig breeds is primarily controlled by 
alleles at the Dominant white/KIT and 
Extension/MCIR loci. The causative muta¬ 
tions for the Belt and Roan alleles at the 
Dominant white/KIT locus and for the a 1 
allele at the ASIP locus remain to be identi¬ 
fied. It is expected that this can be accom¬ 
plished in the not too distant future because 
next-generation sequencing now makes it 


possible to very efficiently and cheaply scan 
the genome for sequence variants associated 
with phenotypic traits (Rubin et al., 2010). 
Another fascinating question to resolve is the 
genetic basis for the difference in background 
colour (white or red) in black-spotted pigs, as 
discussed above. It is also possible that addi¬ 
tional coat colour loci contribute to coat col¬ 
our variation within or between European pig 
breeds, as suggested by classical segregation 
analysis (Legault, 1998), for example pigs 
derived from a cross between Berkshire and 
Hampshire breeds show remarkable colour 
patterns that would not be expected from sim¬ 
ple models, but these need to be verified by 
further molecular work. 

It is also interesting that there has been 
independent selection for black colour and for 
different types of white-spotting phenotypes 
in European and Asian pigs. It is clear that 
most Asian - or at least Chinese - pig breeds 
are fixed for the Asian-specific Dominant 
black allele at the Extension/MCI R locus 
(Fang et al., 2009), but the genetic basis for 
white spotting and white colour in Asian 
breeds remains to be resolved. Evidence exists 
for a non-dominant white coat colour in Asian 
breeds, and the authors have seen evidence 
for similar loci in European pigs. It has been 
reported that neither a belt-like phenotype 
nor the white colour in Rongchang pigs is 
associated with the KIT duplication or the KIT 
splice mutation causing white colour in 
European pigs (Xu et al., 2006; Lai et al., 
2007). It is possible that other Asian-specific 
KIT mutations underlie white colour, but other 
major candidate genes for these phenotypes 
are the microphthalmia-associated transcrip¬ 
tion factor (MITF) and endothelin receptor 
beta (EDNRB) genes. Interestingly, a recent 
screen for genetic polymorphisms in 57 por¬ 
cine homologues of genes associated with 
coat colour variation in mice revealed that 
Chinese Jinhua pigs with a characteristic 
extensive white belt (two-end black) are 
homozygous for an EDNRB missense muta¬ 
tion that is rare or absent in other breeds 
(Okumura et al., 2006, 2010). It is antici¬ 
pated that the genetic basis for white colour 
and white-spotting phenotypes present among 
Asian pig breeds will also be resolved in the 
not too distant future. 
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Introduction 

Morphological traits have not been of much 
importance in pigs. In contrast, inherited disor¬ 
ders have been of considerable importance, 
both from an agricultural point of view (e.g. 
malignant hyperthermia) and in terms of 
animal models for inherited human diseases. 
This chapter, therefore, concentrates on inher¬ 
ited disorders. However, it does also include 
mention of morphological traits that have been 
documented. 

With the molecular revolution now 
bearing fruit, and, in particular, with the devel¬ 
opment of DNA markers covering all regions 
of all pig chromosomes (see Chapters 5, 7 
and 8), knowledge of the genetic basis of 
morphological traits and inherited disorders in 
pigs will increase rapidly in the decades ahead. 
As described below, regularly updated infor¬ 
mation is available on the Internet. By this 
means, it will be possible for readers through¬ 
out the world to obtain the latest information 
on any trait or disorder mentioned in this 
review. 


©CAB International 2011. The Genetics of the Pig, 
2nd Edn (eds M.F. Rothschild and A. Ruvinsky) 


The Range of Possibilities 

The spectrum of morphological traits and 
inherited disorders ranges from those that are 
definitely due to the action of just one gene to 
those that are due to the combined action of 
many genes and many non-genetic (environ¬ 
mental) factors. In between these two extremes 
are many traits and disorders which appear to 
run in families, but for which there is insufficient 
information to enable a conclusion to be drawn 
about whether one or more genes are involved. 
Unfortunately, the literature abounds with 
examples of traits and disorders that have been 
claimed to be due to just one gene, despite the 
data being so sparse that such a claim cannot be 
justified. Similar problems exist with claims of 
inheritance being recessive or dominant; in 
most cases, there is insufficient information to 
justify the claims that have been made. In the 
fullness of time, of course, additional data might 
support the initial claims. But we must be care¬ 
ful not to jump the gun. 

During the 1980s and 1990s, there were 
considerable advances made in identifying the 
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molecular basis of inherited disorders in 
domestic animal species, and the pig was at 
the forefront of such studies. Indeed, the dis¬ 
covery of the molecular basis of malignant 
hyperthermia by Fujii et al. (1991) provides a 
marvellous, pioneering example of the power 
of molecular genetics to solve important, prac¬ 
tical and economically important problems 
(see Chapter 15). 


Previous Reviews 

Several comprehensive reviews of inherited 
traits and disorders in pigs have been pub¬ 
lished over the years. The first major summary 
was by Smith et al. (1936). Since then, there 
have been comprehensive surveys of inherited 
disorders by Johansson (1964), Ollivier and 
Sellier (1982), Hanset (1991) and Robinson 
(1991), together with more wide-ranging 
reviews by Koch et al. (1957), Wiesner and 
Wilier (1974), Huston et al. (1978) (congeni¬ 
tal disorders) and Hamori (1983). More recent 
reviews with a focus on specific types of traits 
include those of Kocwin-Podsiadla (1998), 
Fukawa and Kusuhara (2001) and Misdorp 
(2003), It should be noted that some of these 
reviews are concerned with congenital traits 
and disorders, i.e. traits and disorders that are 
present at birth. Not all such traits and disor¬ 
ders are inherited. 

In compiling the list of single-locus traits 
and disorders presented at the end of this chap¬ 
ter (Table 4.1), the information provided by 
Ollivier and Sellier (1982) and Robinson (1991) 
has been used as a convenient starting point. It 
is interesting to compare the conclusions drawn 
in these two reviews. Ollivier and Sellier dis¬ 
cussed 76 traits and disorders, and concluded 
that 29 are determined by a single locus. Nearly 
a decade later, Robinson listed 64 traits and dis¬ 
orders, of which 33 were regarded as being due 
to a single locus. Of the 64 in Robinson’s list, 
only five were first reported after the publica¬ 
tion of Ollivier and Sellier’s review. Thus, 
Robinson excluded 17 of the disorders described 
by Ollivier and Sellier. This is consistent with 
Robinson’s declaration that he had ‘been delib¬ 
erately more stringent in accepting that a defect 
could be genetically determined’. 


Among the 33 traits and disorders included 
in Robinson’s single-locus list, only three were 
first reported after the publication of Ollivier 
and Sellier’s review. Thus, despite being more 
stringent, Robinson actually declared one more 
pre-1982 traits/disorders to be due to a single 
locus than did Ollivier and Sellier. To what 
extent do the two single-locus lists coincide? 
Somewhat less than one might have thought. 
In fact, only 20 traits/disorders are common 
to the two lists: nine of Ollivier and Sellier’s 
single-locus traits/disorders were excluded 
from Robinson’s list; and ten of Robinson’s 
single-locus traits/disorders were excluded by 
Ollivier and Sellier. 

Of course, these statistics are simply a 
reflection of the difficulties involved in evaluat¬ 
ing inadequate evidence concerning inheritance, 
as described above. Both lists were compiled 
with great care by very competent geneticists. 
The fact that their conclusions differ is a cau¬ 
tionary tale for anyone charged with the task of 
determining whether a particular trait/disorder 
is due to a single gene. 


Current Sources 
of Information 

While a list of reviews is useful, it is even more 
useful to have a single catalogue of morpho¬ 
logical traits and inherited disorders that is 
regularly updated, and which is made available 
freely on the Internet. Human geneticists have 
long had access to such a resource, McKusick’s 
Online Mendelian Inheritance in Man 
(OMIM), available at http://www.ncbi.nlm.nih. 
gov/sites/entrez?db=omim. This catalogue 
contains a wealth of information on thousands 
of morphological traits and inherited disorders 
in humans. It also contains a surprising quan¬ 
tity of information on pigs, because McKusick 
has always been interested in potential animal 
models of human disorders. 

In 1978, the present author commenced 
compiling a catalogue of inherited traits and 
disorders in a wide range of animal species. 
Being modelled on, and complementary to, 
McKusick’s catalogue, this catalogue is called 
Online Mendelian Inheritance in Animals 
(OMIA) and is available at http://omia.angis. 
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org.au/. It is also available alongside OMIM at 
http://www.ncbi.nlm.nih.gov/sites/entrez? 
db=omia. 

OMIA includes entries for all inherited dis¬ 
orders in pigs, together with other traits in pigs 
for which single-locus inheritance has been 
claimed, however dubiously. Each entry consists 
of a list of references arranged chronologically, 
so as to present a convenient history of knowl¬ 
edge about each disorder or trait. For some 
entries, there is additional information on inher¬ 
itance or molecular genetics. If the disorder or 
trait has a human homologue, the relevant 
OMIM numbers are included, providing a direct 
hyperlink to the relevant entry in McKusick’s 
online catalogue OMIM. 

An Overview of Single-locus Traits 

and Disorders 

At the time of writing, OMIA contains entries 
for 222 pig traits or disorders, collectively called 
phenes. Several of them concern coat colour, 
which is discussed in Chapter 3 of this book, 
and these are, therefore, not covered in the 
present chapter. Of the 222 phenes, 72 are 
potential animal models of human disorders 
(see Chapter 17 for detailed discussion of some 
of these phenes). For conciseness, the only 
phenes listed in the present chapter (Table 4.1) 
are the 55 non-coat-colour phenes for which 


there is strong evidence of single-locus inherit¬ 
ance. For each phene listed, the table includes 
the earliest report plus a recent reference, 
extracted from OMIA. If the relevant gene has 
been identified, it is listed, together with its 
genome location, where known. More com¬ 
prehensive information, including a complete 
set of up-to-date references for each entry, 
can be obtained by accessing OMIA at http:// 
omia.angis.org.au/. 

Conclusions 

The list of inherited morphological traits and 
disorders presented in this chapter provides 
an indication of the range of such traits and 
disorders that have been observed and studied 
in pigs. The molecular and gene-mapping 
revolutions now under way will lead to an 
explosion of knowledge in this area in the 
years ahead. To exploit fully the genetic varia¬ 
tion that does occur, breeders and researchers 
need to be continually on the lookout for 
unusual animals, saving them where possible. 
If DNA can be sampled from several genera¬ 
tions of a family in which a particular mor¬ 
phological trait or disorder occurs, and if 
careful records on the occurrence of the trait 
or disorder in that family have been kept, it 
will be an increasingly straightforward matter 
to identify the gene responsible. 



Table 4.1. List of single-locus traits (other than coat-colour) and disorders documented in pigs. Further information on each entry can be obtained from OMIA 
(Online Mendelian Inheritance in Animals; http://omia.angis.org.au/). 


Name of phene 


Location of gene 
SSC (Sus scrofa) chromo¬ 
some number: position in 

OMIA Gene Sscrofa9 (pig genome) assembly 
number (if known) (from http://www.ensembl.org) Description 


Aplasia of tongue 000055 


Arthrogryposis 000069 SSC5 

multiplex 
congenita (AMC) 


Ataxia, progressive 001091 


Blood group 001089 ABO SSC6 

system A 

Blood group 000120 

system B 


Congenital absence of the median portion of the apex of the 
tongue. Nes (1958) presented data from several matings 
supporting single-locus autosomal recessive inheritance 

Persistent flexion of a joint. Also known as bent-stiff-legged, 
bentleg and congenital articular rigidity (CAR). Can be caused 
by various non-genetic factors such as ingestion of 
Jimsonweed or tobacco by pregnant sows. However, there is 
convincing evidence of a single-locus autosomal recessive 
form of this disorder (Lomo, 1985), and the disorder has now 
been mapped to chromosome SSC5 (Genini etal., 2006) 

A progressive failure of muscle coordination, resulting in perverse 
movements. Also known as congenital motor defect or congenital 
ataxia. The central nervous system appears normal at birth, but 
older pigs show dysplasia of the cerebellar cortex. Rimaila- 
Parnanen (1982) provided convincing evidence of autosomal 
recessive inheritance 

This system has at least two alleles. Also known as the A-0 
blood group system. Mariani et al. (1996) mapped the locus to 
SSC6 

This system has at least two alleles 


Blood group 000121 

system C 


This system has at least two alleles 


Blood group 000122 

system D 


This system has at least two alleles 


Earliest and most 
recent reference 


Nes (1958) 


Hallqvist (1933); 
Genini et al. (2006) 


Rimaila-Parnanen 
(1982); Genini et al. 
(2007) 


Sprague (1958); 
Mariani et al. 

(1996) 

Baker and Andresen 
(1964); Erhard et 
al. (1988) 

Andresen and Baker 
(1964); Rasmusen 
(1982) 

Andresen (1962); 
Erhard et al. (1988) 





Blood group 
system E 


000123 


Blood group 000124 

system F 

Blood group 000127 

system G 

Blood group 000128 SSC6 

system H 

Blood group 000129 

system I 

Blood group 000130 

system J 

Blood group 000131 

system K 

Blood group 000132 

system L 

Blood group 000133 

system M 

Blood group 000134 

system N 

Blood group 001250 

system O 


Dermatosis 

vegetans 


000271 


This system has at least 15 alleles 


This system has at least three alleles 

This system has at least three alleles 

This system has at least seven alleles. The locus is linked to 
the gene for malignant hyperthermia, forming part of what 
is commonly called the Hal linkage group 

This system has at least two alleles 

This system has at least three alleles 

This system has at least six alleles 

This system has at least six alleles 

This system has at least 18 alleles 

This system has at least three alleles 

This system has at least two alleles 

A well-characterized syndrome comprising skin lesions on the 
body, swollen feet (club foot), and multinucleate giant cells 
(MGC) in the lungs associated with fatal pneumonia. Occurs 
only in the Landrace breed. Breeding data support single-locus 
autosomal recessive inheritance (Flatla etal., 1961). Evensen 
(1993) showed that the giant cells in the lungs of affected pigs 
are derived from mesenchymal cells, with a monocyte/ 
macrophage origin 


Andresen and Irwin 
(1959a); Hojny and 
Nielsen (1992) 

Andresen (1957); 

Erhard etal. (1988) 
Andresen (1957); 

Erhard et al. (1988) 
Andresen (1957); 
Zeveren et al. 
(1988) 

Andresen (1957); 

Andresen (1966) 
Andresen (1957); 
Hradecky et al. 
(1985) 

Andresen and Irwin 
(1959b); Erhard 
et al. (1988) 
Andresen (1962); 
Marklund et al. 
(1993) 

Nielsen (1961); 

Nielsen (1991) 

Hala and Hojny 
(1964) 

Hojny and Hala 
(1965); Mariani 
et al. (1996) 

Hjarre (1953); 
Evensen (1993) 
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Table 4.1. Continued. 


Name of phene 
Dwarfism 


Epitheliogenesis 

imperfecta 


Ganqliosidosis, 

GM2 


Heterochromia 

iridis 


Location of gene 
SSC (Sus scrota) chromo¬ 
some number: position in 

OMIA Gene Sscrofa9 (pig genome) assembly Earliest and most 

number (if known) (from http://www.ensembl.org) Description recent reference 


000299 COL10A1 SSC 1: 85743574- 

85750572 


000348 


000403 


000468 


There have been several isolated reports of dwarfism in pigs, 
with the most convincing single-locus evidence being 
provided by Jensen et al. (1984). The two reports with the 
greatest clinical detail are those by Kaman et al. (1991) and 
Shirota et al. (1995). Nielsen et al. (2000) identified a 
missense mutation in the COL10A1 gene in a form of 
dwarfism in pigs that is analogous to Schmid metaphyseal 
chondrodysplasia (SMCD) in humans 

Congenital absence of areas of skin. Also known as aplasia 
cutis. Affected animals usually die within 3 days, but some 
survive to adulthood. The chance of survival is indirectly 
proportional to the area of skin that is absent. A single-locus 
autosomal recessive disorder 

A lysosomal storage disease in which there is a build-up 
(storage) of GM2 gangliosides (a type of glycolipid) in 
various tissues, due to the lack of the enzyme hexosamini¬ 
dase, whose task is to break down the GM2 ganglioside 
into its constituents. Characterized by progressive 
neuromuscular dysfunction and impaired growth from an 
early age 

Difference in colour of the iris in the two eyes, or in different 
areas of one iris. In pigs, part (partial heterochromia) or all 
(complete heterochromia) of the iris lacks pigment. The latter 
form is called glass-eye. Reconciling data from Durr (1937) 
and Gelati et al. (1973), Ollivierand Sellier (1982) concluded 
that bilateral complete heterochromia iridis is autosomal 
recessive, and that heterozygotes show unilateral and partial 
heterochromia 


Petrov (1974); 
Nielsen et al. 
( 2000 ) 


Nordby (1929); 
Benoit-Biancamano 
et al. (2006) 


Read and Bridges 
(1968); Kosanke 
et al. (1978) 


Durr (1937); Gelati 
et al. (1973) 





000472 


Hind limb 
paralysis 


Hypercholes- 000499 LDLR SSC2:52433429- 
terolaemia, 52437209 

spontaneous 


Hypotrichosis, 

dominant 


001278 


Evidence for autosomal recessive inheritance of hind limb Berge (1941); 

paralysis was presented by Berge (1941). Ollivier and Sellier Ludvigsen et al. 

(1982) cited a study by Ludvigsen et al. (1963) as providing (1963) 

evidence for autosomal recessive inheritance of paralysis of the 
hind limbs, associated with abnormal lumbar vertebrae. 

Because the Berge disorder did not involve vertebral abnormali¬ 
ties, it is possible that more than one locus can give rise to hind 
limb paralysis 

An excess of cholesterol in the blood. Also called cholesterolaemia. Lee et al. (1990); 

A strain of pigs that shows spontaneous development of Pena et al. (2009) 

hypercholesterolaemia has a mutation in the gene for 

apolipoprotein B ( apoB ), designated Lpb5. This mutant apoB 

allele is associated with low-density lipoprotein (LDL) particles 

that are deficient in their ability to bind to the LDL receptor (Purtell 

et al., 1993), and that are therefore cleared from the circulation 

more slowly. The resultant hypercholesterolaemia, however, is 

not a single-locus trait. Indeed, there appears to be at least 

one other gene of large effect involved in this disorder (Aiello 

etal., 1994). A genome scan conducted by Hasler-Rapacz etal. 

(1998) showed that the gene for this disorder in pigs maps 
near to the centromere of chromosome 2, which is homolo¬ 
gous to the region of human chromosome 19 containing the 
gene for the low-density lipoprotein receptor (LDLR), a strong 
candidate for involvement in this disorder. Sequence analysis 
of the LDLR gene from homozygous normal and affected pigs 
showed that the disorder is due to a single missense mutation 
(resulting in the amino-acid substitution Arg84Cys) 


An autosomal dominant form of hairlessness (or hypotrichosis). Meyer and Drommer 
The listing here of two separate loci for hypotrichosis is based (1968) 
on precedent (Ollivier and Sellier, 1982; Robinson, 1991) and 
on supposition of the present author. Breeding trials to test 
non-allelism have not been reported 
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Table 4.1. Continued. 


Location of gene 
SSC (Sus scrota) chromo¬ 
some number: position in 

OMIA Gene Sscrofa9 (pig genome) assembly Earliest and most 

Name of phene number (if known) (from http://www.ensembl.org) Description recent reference 


Hypotrichosis, 001458 SSC15 

juvenile with 
age-dependent 
emphysema 


Hypotrichosis, 001279 

recessive 

Legless 00587 


Lymphosarcoma 000615 


This disorder is characterized by hairlessness until puberty, and 
localized emphysema in the lungs following puberty (Bruun 
et a/., 2008). Similarity of the phenotype with an integrin beta 
6 knockout mouse disorder led Bruun et al. (2008) to show 
that this disorder in pigs maps to the location of this gene 
(ITGB6) in pigs, i.e. SSC15. However, no lesion in the porcine 
version of this gene or in the gene for the other subunit of 
integrin beta 6 (ITGAV) has yet been found 

An autosomal recessive form of hairlessness (or hypotrichosis) 

The absence of all four legs. Also called streamlined. Johnson 
and Lush (1939) and Johnson (1940) provided convincing 
evidence of autosomal recessive inheritance of this lethal 
disorder 


Bruun et al. (2008) 


Roberts and Carroll 
(1931) 

Johnson and Lush 
(1939); Johnson 
(1940) 


A malignant neoplastic disorder of lymphoid tissue. Also called McTaggart et al. 
lymphoma. Sometimes also called leukaemia, although, (1971); Hejazi and 

strictly speaking, this is a different disorder (namely, a Danyluk (2005) 

malignant disorder of the blood-forming tissue). Clinical 
signs of lymphosarcoma in pigs include stunted growth, pot 
belly, enlargement of superficial lymph nodes, an increase in 
circulating lymphocytes, and death before 15 months of age. 

Post-mortem reveals sarcoma involving all lymph nodes, but 
primarily those draining the gut and lung (Head et al., 1974). 

McTaggart et al. (1979) provided convincing evidence that 
this disorder is autosomal recessive in pigs 
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Malignant 

hyperthermia 


Meat quality 


000621 RYR1 SSC6:32706687-32735141 


001085 PRKAG3 SSC15q21-22 


A progressive increase in body temperature, muscle rigidity and Briskey (1964); 
metabolic acidosis, leading to rapid death. In pigs, malignant Laville et al. 
hyperthermia (MH) leads to rapid post-mortem changes in (2009) 

muscle, resulting in pale soft exudative (PSE) meat. MH can 
be triggered by a minor stress, such as loading, transport, 
sexual intercourse, high ambient temperature, or exposure to 
the anaesthetic halothane. Susceptibility to halothane-induced 
MH is an autosomal recessive trait in pigs. Together, sudden 
death and PSE constitute porcine stress syndrome (PSS), 
which became a major economic problem in many countries in 
the 1970s. In part, this was due to strong selection for 
increased leanness, which is associated with susceptibility to 
PSS. In 1991, a major breakthrough occurred when a 
Canadian research team led by David MacLennan (Fujii et al., 

1991) showed that MH is due to a base substitution (C^T) in 
the 1843rd nucleotide of the gene for the calcium release 
channel in the sarcoplasmic reticulum of skeletal muscle. The 
base substitution causes an amino-acid substitution 
(arginine^cysteine) in the 615th position of the calcium- 
release channel, resulting in altered calcium flow. A PCR 
genotyping test based on the causative mutation has been 
used extensively in many countries, leading to the elimination 
of the mutation from many herds. See Chapter 16 


The RN (Rendement Napole) mutation was first documented by Leroy et al. (1990); 
Leroy et al. (1990). In longissimus dorsi muscles, carriers of Jennen et al. 
this dominant gene show lower pH, higher surface and (2007) 

internal reflectance values, lower protein extractability, lower 
water-holding capacity, lower Napole yield (yield after curing 
and cooking) and greater cooking loss. On the positive side, 
carriers have a lower shear force value, a stronger taste and 
smell and greater acidity. The primary cause of these 
differences is that the mutant allele results in higher stored 
glycogen content in muscle. The gene is located on the long 
arm (q21—22) of chromosome 15 
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Table 4.1. Continued. 


Name of phene 


Location of gene 
SSC (Sus scrota) chromo¬ 
some number: position in 

OMIA Gene Sscrofa9 (pig genome) assembly 
number (if known) (from http://www.ensembl.org) Description 


Membranopro- 000636 CFH 
liferative glomeru¬ 
lonephritis type II 


Motor neuron 000662 

disease, lower 


A progressive inflammation of the capillary loops in the 
glomeruli of the kidney in which the glomeruli become 
enlarged as a result of the proliferation of mesangial cells and 
irregular thickening of the capillary walls, due to massive 
glomerular deposits of complement component C3 and the 
terminal C5b-9 complement complex. This disorder is due to 
a deficiency of complement factor H, whose task is to restrict 
the formation of C3 convertase. The hypermetabolism of the 
excess C3 results in the deposits of complement in the 
kidney, and in hypocomplementaemia.The disorder is 
autosomal recessive in pigs. Affected piglets die at around 5 
weeks 

A distinctive locomotor disorder of weaners, characterized by 
progressive ataxia and paresis of variable severity. 
Histological examination reveals significant degenerate 
changes in lower motor neurons, lower (motor) spinal nerve 
roots, and myelinated axons of peripheral nerves and of 
ventral and lateral spinal columns (O’Toole et al., 1994b). 
Lipid-like inclusions and mitochondrial swelling suggest an 
underlying defect in lipid metabolism and/or mitochondrial 
metabolism (O’Toole et al., 1994b). Limited breeding data 
are strongly suggestive of autosomal dominant inheritance, 
and a breeding colony has been established at the 
Wyoming State Veterinary Laboratory (O’Toole et al., 

1994a) 


Earliest and most 
recent reference 


Jansen (1993); 
Hegasy et al. 
( 2002 ) 


Wells etal. (1987); 
O’Toole et al. 
(1994b) 


F.W. Nicholas 




Neonatal diarrhoea, 001088 SSC13 

F4 (previously 
K88) 


Nucleoside trans- 001236 
port defect 


Oedema 000493 


Neonatal diarrhoea in piglets is often caused by strains of Gibbons et al. (1977); 

Escherichia coli bacteria that have a cell-surface antigen called Joller et al. (2009) 
K88 (renamed F4), which combines with a glycoprotein 
receptor on the wall of a piglet’s intestine, enabling the bacteria 
to attach themselves to the intestine. The receptor is a type of 
transferrin. Once attached, the bacteria proliferate, releasing 
enterotoxins and thus producing diarrhoea, which can lead to 
high mortality. Certain piglets lack the intestinal receptor to F4, 
and are therefore resistant to F4 bacteria and hence to 
diarrhoea caused by F4 strains. There are several different 
antigenic variants of F4 (ab, ac, ad), and it seems that there is a 
separate receptor for each. The presence or absence of at least 
two of these receptors (for F4ab and F4ac) is determined for 
each receptor by one of two closely linked genes on chromo¬ 
some 13. The immunological and population-genetics implica¬ 
tions of segregation at these loci are very interesting: the result 
is selection against heterozygotes, as explained by Nicholas 
(2010, pp. 132-133). From a three-generation linkage analysis, 

Python et al. (2002) concluded that the receptor for F4ac is the 
same as the receptor for F4ab, and that this single receptor is 
encoded by a gene they have called F4bcR, which they have 
more finely mapped to SSC13 

Defect in the transport of nucleosides (purine or pyrimidine base Aran and Plagemann 

attached to a ribose or deoxyribose sugar) across erythrocyte (1992) 
membranes. This disorder has not been recorded in pigs in vivo. 

However, in a kidney cell line, Aran and Plagemann (1992) 
created several different mutants that resulted in failure to 
transport thymidine and uridine. The faulty gene has not yet 
been identified 


Abnormal accumulation of fluid in tissues and/or body Young (1952); 

cavities. Also called myxoedema, dropsy or hydrops. In Neeteson (1964) 

reviewing the evidence summarized by Koch et al. (1957), 

Ollivier and Sellier (1982) concluded that oedema is an 

autosomal recessive disorder, possibly to do with a thyroid 

defect ^ 
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Table 4.1. Continued. 


Name of phene 


Location of gene 
SSC (Sus scrofa) chromo¬ 
some number: position in 

OMIA Gene Sscrofa9 (pig genome) assembly 
number (if known) (from http://www.ensembl.org) Description 


Polydactyly with 000811 
otocephalic 
monster 


Porphyria 000815 


Progressive 000829 

myopathy 


In village pigs of Papua New Guinea, Malynicz (1982) reported 
an autosomal dominant disorder in which heterozygotes are 
polydactylous, and homozygotes are ‘monsters’, having club 
foot (dactylomegaly, or abnormally large digits) and otocephaly 
(absence of the lower jaw, with the ears united below the face) 

Porphyria is a general term for disorders resulting from a 
deficiency of any one of the six enzymes involved in the 
biosynthesis of protoporphyrin from aminolaevulinic acid. 
Some of the intermediates (loosely called porphyrins) are 
extremely photoreactive. Because a deficiency of any one of 
these enzymes results in a build-up of intermediates, 
photosensitivity is a common clinical sign in most species, but 
not in pigs. In this species, clinical diagnosis is based on 
discoloration (pink to brown) of the teeth. On autopsy, bones, 
kidneys and lymph nodes also show a brown discoloration. 
Another major clinical sign is haemolytic anaemia, due to a 
deficiency of haemoglobin, of which protoporphyrin is a vital 
component. There have been only isolated reports in pigs, 
with only one study describing the likely enzyme deficiency: 
Roels et al. (1995) attributed their cases of porphyria to a 
combined deficiency of the third (uroporphyrinogen-lll-cosyn- 
thetase) and fourth (uroporphyrinogen decarboxylase) of the 
six enzymes involved in protoporphyrin biosynthesis. Their 
cases would therefore be a ‘mixed’ porphyria, comprising 
aspects of congenital erythropoietic porphyria and porphyria 
cutaneous tarda 

A progressive degeneration of the hind legs, resulting in 
collapse. Also known as creeper syndrome. Wells et al. (1980) 
provided evidence for autosomal recessive inheritance 


Earliest and most 
recent reference 


Malynicz (1982) 


Clare and Stephens 
(1944); Roels et al. 
(1995) 


Wells and Bradley 
(1978); Wells et al. 
(1980) 




Protamine-2 

deficiency 


000834 


Renal cysts 001257 


Resistance to 000862 FUT1 SSC6:37631354- 
oedema disease 37632621 


In most mammals, protamine-2 constitutes the major compo- Maier et at. (1990) 
nent of basic protein in sperm nuclei. It binds to DNA during 
the elongation of spermatids. Pigs and cattle have the 
protamine-2 gene, but produce very little protamine-2. In the 
case of pigs, this is due to a deletion of 27 bases in the 
middle of the gene. It appears that all pigs are homozygous 
for this mutation. Presumably, the mutation occurred before 
the splitting of the porcine evolutionary lineage 

The occurrence in the kidney of closed epithelium-lined sacs Wijeratne and Wells 
containing a liquid or semi-solid substance. Wijeratne and (1980) 

Wells (1980) reported breeding data that provided convincing 
evidence of autosomal dominant inheritance 

Not to be confused with oedema (see separate entry), oedema Bertschinger et al. 
disease is a fatal infectious disease of weaners and growers, (1993); Coddens 
characterized by lack of coordination, a hoarseness of voice, et al. (2008) 
weakness, flaccid paralysis and blindness. Although oedema of 
the eyelids, face and ears is diagnostic, this trait is seldom 
visible during clinical examination. Also called gut oedema or 
bowel oedema. Caused by opportunistic colonization by certain 
strains of E. coli. Bertschinger et al. (1993) provided substantial 
evidence from breeding experiments for single-locus autosomal 
recessive inheritance of resistance to experimental challenge 
with a causative strain of E. coli. The gene is called ECF18R 
(for E. coli receptor FI 8), with alleles B for susceptibility and b 
for resistance. Vogeli et al. (1996) mapped ECF18R to the 
middle of the Hal linkage group on chromosome 6. Noting that 
one of the genes in this linkage group is the S gene (for 
suppression of blood group system A), and that the likely human 
homologue of S is blood group system l-l (whose locus encodes 
the enzyme alpha-fucosyltransferase, FUT), Meijerink etal. 

(1997) isolated porcine homologues of FUT, and showed that 
they mapped to the same location as the S gene, i.e. the 
recombination fraction between FUT and S is zero. By sequenc¬ 
ing one of the FUT genes (FUT1) in resistant and susceptible 
pigs, these same workers then identified two base substitutions 
that are in strong linkage disequilibrium with resistance/ 
susceptibility. In the short term, these polymorphisms provide 
valuable markers for resistance. In the not-too-distant future, we co 

will know whether FUT is actually the ECF18R gene 

Continued 
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Table 4.1. Continued. 


Location of gene 
SSC ( Sus scrofa) chromo¬ 
some number: position in 

OMIA Gene Sscrofa9 (pig genome) assembly Earliest and most 

Name of phene number (if known) (from http://www.ensembl.org) Description recent reference 


Respiratory distress 000101 
syndrome 


Sex reversal: XX 000901 
male 


Sperm, short tail 001334 KPL2 SSC16:18508929- 

18759937 


Syndactyly 000963 


Thrombopathia 001003 


Also known as barker syndrome, because of the resemblance 
between affected piglets and barker foals (Gibson et al., 
1976). Characterized by very small thyroid glands, hairless¬ 
ness, retarded ossification, delayed haemopoiesis and death 
soon after birth due to acute respiratory distress (Wrathall, 
1976). Breeding data presented by Wrathall (1976) provide 
convincing evidence of autosomal recessive inheritance 

Switoriski et al. (2002) provided independent confirmation of the 
autosomal recessive inheritance of SRY-negative karyotypi- 
cally XX pigs with varying degrees of phenotypic maleness 

Known as the immotile short-tail sperm (ISTS) defect, this 
disorder is due to an insertion of a retrotransposon in the 
KPL2 gene (Sironen etal., 2006) 

Fusion of the digits. Convincing evidence for autosomal 
dominant inheritance was provided by Simpson and Simpson 
(1908), soon after the rediscovery of Mendelism. Breeding 
data reported by Detlefsen and Carmichael (1921) confirmed 
this conclusion. There have been no recent reports of 
breeding data for this disorder 

A blood coagulation disorder due to failure of ADP release from 
platelets following stimulation by aggregation factors such as 
thromboplastin. Also called storage pool deficiency. 
Characterized by mild-to-moderate bleeding. Shown to be 
autosomal recessive by Thiele et al. (1986) 


Gibson et al. (1976); 
Wrathall (1976); 
Wrathall et al. 
(1977) 


Pailhoux et al. (1994); 
Switoriski et al. 
( 2002 ) 

Andersson et al. 
(2000); Kopp et al. 
(2008) 

Auld (1889); Leipold 
and Dennis (1972) 


Daniels et al. (1986); 
Thiele et al. (1986) 
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Tremor type A III, 000770 
congenital 
(X-linked) 


Tremor type A IV, 001020 
congenital 


Tremor, high- 001200 SSC7q 

frequency 


There are several different forms of congenital tremor syn- Harding et al. (1973); 

drome, also known as dancing pig disease, trembles or Baumgartner and 

myoclonia congenita. The common clinical feature comprises Brenig (1996) 
rhythmic tremors of the head and limbs, being most severe 
when standing and being absent when sleeping. Except when 
provoked by external stimulus, e.g. handling or sudden noise, 
tremors are rarely seen after the first few months of life and 
usually disappear by the second or third week. Affected pigs 
have decreased life expectancy due to decreased ability to 
nurse and to avoid being crushed by the sow. Done and 
Harding (1967) distinguished between those forms that show 
lesions in the central nervous system (type A) and those that 
do not (type B). Within type A, there are two forms caused by 
infectious agents (I and II) and two inherited forms (III and IV) 

(Done, 1968). Congenital tremor type III A, also known as 
cerebrospinal hypomyelinogenesis, is an X-linked recessive 
disorder 


Background details are given in the previous entry. Congenital 
tremor type A IV, also known as cerebrospinal dysmyelino- 
genesis, is an autosomal recessive disorder 

This disorder was described by Richter etal. (1995), from whom 
the following description is taken. The disorder is characterized 
by muscular weakness and a very intense tremor of the legs 
when standing and walking but not when at rest in a lying 
position. The intensity of tremor and muscular weakness 
progressively increases with age, resulting in pronounced 
postural instability. Also known as the Campus syndrome, after 
the boar whose offspring first showed the disorder. Breeding 
data show autosomal dominant inheritance. The disorder maps 
to SSC7q (Tammen etal., 1999) 


Patterson et al. 
(1973); Blakemore 
and Harding 
(1974) 

Richter et al. (1995); 
Tammen et al. 
(1999) 
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Table 4.1. Continued. 


Name of phene 


Location of gene 
SSC (Sus scrofa) chromo¬ 
some number: position in 

OMIA Gene Sscrofa9 (pig genome) assembly 
number (if known) (from http://www.ensembl.org) Description 


Vitamin 000837 CYP27B1 

D-deficiency 
rickets, type I 


Vitamin D-deficiency rickets, type I. Vitamin D (cholecalciferol) 
is synthesized in the skin from 7-dehydrocholesterol by the 
action of UV radiation from sunlight. Cholecalciferol, 
however, has very little biological activity: it requires two 
hydroxylations in order to become (biologically) active. The 
first hydroxylation, catalysed by cholecalciferol 25-hydroxyl¬ 
ase, occurs in the liver. The second of these hydroxylations 
occurs in the kidney under the action of the enzyme 
25-alpha-hydroxycholecalciferol 1-hydroxylase. The resultant 
active form of vitamin D (called 1,25-dihydroxycholecalciferol 
or 1,25(OH)sub2D) is a steroid hormone that plays a vital 
role in whole-body calcium homeostasis. Vitamin 
D-deficiency rickets, type I (previously known as pseudo¬ 
vitamin D deficiency rickets) is an inherited deficiency of the 
1-hydroxylase enzyme. As expected, this deficiency results 
in clinical signs indistinguishable from those seen in 
individuals suffering from non-genetic lack of vitamin D, 
most commonly resulting from a dietary deficiency of 
calcium or insufficient exposure to sunlight. The clinical 
signs of rickets (inherited and non-genetic) arise from 
defects in calcium homeostasis. The most 
noticeable effects include a failure of calcification of bones 
(leading to bowing of limbs) and delayed dentition. The 
molecular basis of this disorder in pigs was determined by 
Chavez et al. (2003), who showed that in Hannover pigs the 
clinical signs result from either of two deletions in the gene for 
cytochromome P450C1 (otherwise known as CYP27B1) 


Earliest and most 
recent reference 


Meyer and Plonait 
(1968); Chavez 
et al. (2003) 


F.W. Nicholas 




von Willebrand 001056 
disease 


Wattles 001061 


Woolly hair 


001256 




von Willebrand factor (vWF) is a multimeric form of a plasma 
protein encoded by an autosomal gene (not yet mapped in 
pigs). vWF plays a vital role in platelet adhesion and clot 
formation. It also combines with factor VIIIC (the product of 
the X-linked haemophilia A locus), forming factor VIII. vWF 
accounts for 99% of the mass of factor VIII; its role is to 
protect factor VIIIC from degradation, von Willebrand disease 
(also called pseudohaemophilia or vascular haemophilia) is 
an autosomal bleeding disorder resulting from deficient or 
defective vWF. The molecular basis of this disorder in pigs 
has not yet been determined 

Appendages suspended from the head. Also known as tassels 
or bells, they are fleshy masses of cartilaginous material 
covered with normal skin and suspended from the mandibular 
area. Roberts and Morrill (1944) provided good evidence for 
single-locus autosomal recessive inheritance 

Curly coat, as seen in the Canastrao breed. An autosomal 
dominant trait, which may be allelic with recessive 
hypotrichosis 


G> 


Flogan et al. (1941); 
Sauger et al. 
(2005) 


Kronacher (1924); 
Roberts and Morrill 
(1944) 


Rhoad (1934) 
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Introduction However, there have also been enormous 

conceptual developments in our understanding 
There has been enormous progress in the of mammalian genomes in that same time 

molecular characterization of the genome of period that reinforce the need for continued 

the pig, and indeed many other mammalian extrapolation or inference of properties of the 

species, in the 12 years since the publication of pig genome from other mammals, such as 

the first edition of The Genetics of the Pig. humans or the mouse. While it is reasonable to 

A partial genome sequence has been available for assume that the pig has a typical mammalian 

a number of years (Wernersson et ah, 2005); genomic structure and molecular biology, given 

a complete genome sequence is imminent (see the universality of some features across all 

Chapter 8); and technical developments in mammals studied, and also given the concord¬ 
sequencing methodology, so-called NextGen ance with other mammals in those features 

sequencing, are set to further revolutionize specifically examined in the pig, this review will 

discovery and characterization of genetic varia- cite studies specific to the pig wherever possi- 

tion in the domestic pig and its close relatives. ble and will extrapolate only where necessary. 


©CAB International 2011. The Genetics of the Pig, 
2nd Edn (eds M.F. Rothschild and A. Ruvinsky) 
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Perhaps the most profound conceptual 
leap arises from the realization that there is at 
least an order of magnitude more transcripts 
than the approximately 20,000 protein coding 
genes in the genome of mammals, and that the 
transcription of non-protein- (or non-) coding 
RNAs (ncRNAs) is pervasive. Indeed, a sub¬ 
stantial proportion of recognized transcripts 
are for ncRNA, many of which are initiated 
from the 3' UTRs (three prime untranslated 
mRNA regions) of protein-coding loci (The 
FANTOM Consortium et a/., 2005). The rec¬ 
ognition that the majority of the mammalian 
genome is transcribed, much of it from both 
DNA strands (The FANTOM Consortium et a/., 
2005), profoundly affects our understanding 
and interpretation of genome sequences of 
even the best characterized mammals, let alone 
the less intensively studied pig. It is now very 
clear that information in the genome does not 
lie only in short collinear, mainly protein¬ 
encoding exonic sequences comprising 2% of 
the genome and separated by 98% of long, 
informationally devoid deserts. Instead, it 
appears that information is encoded in a multi¬ 
functional, overlapping, even bidirectional 
fashion, dispersed across the genome and 
described by Kapranov et ai (2007) as an 
interleaved model. 

One important component of the ncRNA 
spectrum of the pig genome not even men¬ 
tioned in the first edition of this chapter, 
namely microRNA (miRNA), is now partially 
characterized in pigs and can be compared 
with similar sequences in other mammals. 
Because miRNAs have important regulatory 
and developmental functions and are amen¬ 
able to manipulation to control gene expres¬ 
sion and even to attack pathogens as short 
interfering (or short inhibitory) RNA (siRNA), 
they will receive commensurate weight in this 
review. 

Since 1998, porcine endogenous retro¬ 
viruses (PERVs), another component of the pig 
genome, have received a level of investigation 
and original paper publication entirely dispro¬ 
portionate to their representation in the genome 
because of concerns that they might represent 
a zoonotic hazard to xenotransplantation of 
porcine tissues into humans. These also will be 
reviewed in detail. 


Nuclear Genome Size, 'Gene Number’ 
and Developmental Complexity 

For many years, the C-value paradox 
(Hinegardner, 1968; Britten and Davidson, 
1971; Thomas, 1971) described the lack of 
correlation of genome sizes in eukaryotic 
organisms with their developmental complex¬ 
ity. In vertebrates, at one extreme, the puffer 
fish, Fugu rubripes , and related tetraodontoid 
fish have particularly small haploid genomes 
of only 4 x 10 8 bp, the smallest recorded for 
any vertebrate (Brenner et a/., 1993). The 
Fugu genome, which is 7.5 times smaller than 
a mammalian genome, contains less than 10% 
repetitive DNA. At the other extreme, the 
genome of the lungfish, Protopterus aethiopi- 
cus, is 350 times larger than that of the puffer 
fish at 1.4 x 10 11 bp. Although many of the 
more extreme genome sizes, such as that of 
the lungfish, are now thought to be artefacts 
of polyploidy, the C-value paradox focused 
attention on the fact that much of the genome 
of complex organisms, especially the non¬ 
coding and often repetitive fraction that com¬ 
prises 98% or more in many organisms, had 
no known function, and indeed was often dis¬ 
missed as junk DNA owing to the fact that it 
could vary so substantially in amount between 
species. If it was doing anything, it was not 
obvious. Although the genome sizes of mam¬ 
mals, by comparison, are relatively uniform, 
they still contain a very large proportion of 
non-coding and repetitive sequences. 

If total genomic DNA content does not 
explain differences in developmental complex¬ 
ity, perhaps the answer might lie in the number 
of protein-coding sequences. Before the com¬ 
pletion of the human genome sequence in 
2002, there was much speculation about the 
number of protein-encoding genes in the 
genome (e.g. Fields et al. , 1994), with initial 
estimates from about 50,000 to 150,000. The 
now generally accepted and, to some biologists, 
disappointingly low figure of about 20,000 for 
humans and other mammals has created a new 
paradox, termed the G-value paradox by Hahn 
and Wray (2002). Not only do mammals have 
far fewer protein-coding genes than was origi¬ 
nally thought, the notably simpler nematode 
worm, Caenorhabditis elegans, has almost as 
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many at 19,300 (Stein et a/., 2003), and even 
protozoa such as Tetrahymena thermophila 
have many more protein-coding genes (about 
27,000) than mammals (Eisen et a/., 2006). 
Clearly, simple counts of protein-encoding 
genes do not scale with increasing developmen¬ 
tal complexity. Although alternative splicing 
does provide an enormous increase in the 
number of protein-encoding transcripts and the 
complexity of their regulation, which partly 
relieves the G-value paradox, the discovery of 
the functional importance of many ncRNAs and 
pervasive, virtually genome-wide, transcription 
have led to changes in how we should even think 
about question of genome size and organismal 
complexity. Crude comparisons of genome size 
are not meaningful; nor are crude comparisons 
of counts of protein-coding sequences. However, 
a more inclusive definition of the word ‘gene’ to 
include the many newly discovered classes of 
non-coding but regulatory RNAs, as well as a 
recognition of the interleaved rather than 
collinear pattern of encoding information in 
DNA, will not only further contribute to relieving 
the C-value and G-value paradoxes, but also help 
to explain the 98% or more of the mammalian 
genome that has previously had no clearly 
assigned function. 

So how do the size and other properties 
of the nuclear genome of the pig match up 
with the genomes of other mammals? The esti¬ 
mated genome size, based on flow karyotyping 
(Schmitz et a/., 1992) is 2.72 x 10 9 bp for an 
X-bearing haploid nucleus and 2.62 x 10 9 bp 
for a Y-bearing haploid nucleus. John and 
Miklos (1988) have reported an approximate 
twofold range in DNA content in mammals 
from 2.1 x 10 9 bp in the muntjac deer, 
Muntiacus muntjak, to 5 x 10 9 bp in the 
aardvark, Orycteropus afer. This places the 
pig in the low-to-mid range of mammalian 
genome sizes, although differences in meas¬ 
urement methodology mean that some of these 
estimates must be interpreted cautiously. 
Wernersson et al. (2005) have made detailed 
comparisons of their partial porcine genome 
sequence with human and mouse sequences 
and have shown that for all classes of sequence 
comparison - exonic, 5' and 3' UTR, intronic, 
intergenic and miRNA - the pig sequence is 
more similar to the human sequence, despite a 


similar divergence time of artiodactyls, primates 
and rodents from a common ancestor. 


Repetitive DNA, GC Content 
and Isochores 

Repetitive sequences in the genomes of many 
species were first recognized on the basis of 
reassociation kinetics over 40 years ago. When 
genomic DNA was made single stranded by 
heating in solution, it was recognized that some 
components of the genome were able to 
reassociate into double-stranded DNA very 
rapidly when the solution cooled, implying that 
they were present at very high copy numbers in 
the genome - sometimes millions of copies. On 
this basis, it was estimated that 40-60% of the 
genome of mammals comprised repeated 
sequences. Repetitive DNA can be broadly clas¬ 
sified into two categories, namely tandemly 
repeated and dispersed repeated DNA. 
Tandemly repeated DNA ranges from highly 
repeated sequences, which often consist of 
small repeat units, to less highly repeated 
sequences, which may sometimes be functional, 
for example the ribosomal RNA genes. In the 
dispersed repeated category are transposable 
elements, some of which retain the capability of 
further movement, and pseudogenes. 

Caesium chloride density-gradient ultra¬ 
centrifugation was routinely used in the past 
for isolating DNA based on density. As the 
base composition of highly repeated DNA can 
differ substantially from the overall composi¬ 
tion of the genome, many repeated fractions 
could be recognized and isolated on the basis 
of their different densities, occurring most 
obviously as distinct satellite bands in the CsCl 
gradients. More subtly, Bernardi et al. (1985) 
used CsCl gradient ultracentrifugation to parti¬ 
tion the nuclear DNA of warm-blooded verte¬ 
brates into up to six density classes (LI to L3, 
HI to H3) determined by increasing GC 
content, which they termed isochores. Protein¬ 
coding sequences are found in the heaviest 
isochores, with about 50% of human protein¬ 
coding sequences found in H2 and H3, which 
comprise only 8% and 4%, respectively, of the 
genome. The clustering of coding genes into 
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GC rich isochores correlates with the clustering 
of protein-coding sequences into cytogeneti¬ 
cally observable R bands, which are GC rich. 

The spectrum of sequences contributing 
to the Wernersson et al. (2005) incomplete 
genome sequence for the pig comprised 
approximately 34% of repetitive sequences, of 
which 11.3% were SINE (short interspersed 
nuclear element) sequences, 16.14% LINEs 
(long interspersed nuclear elements), 2.8% 
LTR (long terminal repeat) elements (including 
endogenous retroviruses), 0.9% DNA trans- 
posons, 1.47% satellites, 0.62% simple repeats 
and 0.53% low complexity repeats. 

The mean GC contents of the pig sequences 
(Wernersson et a/., 2005) were found to be 
40.7% for introns, 49.6% for coding, 41.8% 
for 3' UTR, 59.2% for 5' UTR and 39.6% for 
intergenic sequences. The GC compositions 
across these classes are very similar to those 
found in humans and mice, and are consistent 
with earlier predictions about the clustering of 
coding sequences in GC-rich domains in the 
genome. It is noteworthy that introns, 3' UTRs 
and intergenic sequences all have low GC con¬ 
tents of around 40%, whereas coding sequences 
have approximately 50% GC. The GC content 
of 5' UTRs is close to 60%. Reuter et al. (2008) 
discuss the reason why 5' UTRs might evolve 
biased GC contents. 


Centromeres and centromeric 

DNA repeats 

Kinetochores are complex eukaryote structures 
that assemble on centromeric DNA and are 
essential for regular chromosome disjunction. 
Some eukaryotes, such as the yeast, 
Saccharomyces cerevisiae , have short centro¬ 
meric sequences, which specifically recognize 
centromeric proteins, and form point centro¬ 
meres. Other eukaryotes, including mammals, 
have regional centromeres based on very long 
(megabase) tracts of highly repeated, usually 
AT-rich DNA, consisting of 120-180 bp repeats 
(Meraldi et al ., 2006; Frzewloka and Glover, 
2009). However, it is not the direct interaction 
of these repeat sequences within these tracts 
with the kinetochore proteins that recruits the 
kinetochore, but epigenetic modification of the 


histones associated with this DNA (Dalai, 2009). 
A particular histone-H3 variant, called CenH3 
(also CenF-A in mammals), replaces H3 to iden¬ 
tify centric specific nucleosomes. These specially 
marked nucleosomes then initiate the recruit¬ 
ment of over 40 proteins comprising the kineto¬ 
chore. As kinetochores normally assemble in the 
same position, except in rare and exceptional 
cases of neo-centromere formation, there must 
be some signal for this epigenetic mark to be 
made, but this is not yet clearly recognized (Dalai, 
2009). Pericentric transcription in the late G2 
early M phase of mitosis is correlated with an 
increase in CenH3 levels. The requirement of 
the usually trancriptionally quiet heterochromatic 
environment for centromere formation is sup¬ 
ported by several cases of neo-centromeres that 
have occurred in gene-poor regions with little 
transcriptional activity. 

Two distinct centromeric repeat families 
have been characterized in the pig. Jantsch et al. 
(1990) have described the Mcl family of repeats, 
restricted to the metacentric chromosomes in 
the pig karyotype, and the Ac2 family, whose 
distribution is restricted to the acrocentric chro¬ 
mosomes. The repeat unit of the heterogeneous 
Mcl family is about 100bp. Some family mem¬ 
bers occur in the pericentromeric region of all 
metacentric chromosomes, whereas others are 
found only on specific metacentric chromo¬ 
somes. The Ac2 family is homogeneous and 
found in the subterminal pericentromeric regions 
of all acrocentric chromosomes. The Ac2 family 
is derived from a highly conserved 14 bp repeat 
unit. Jantsch et al. (1990) have speculated that 
the association of the centromeres of porcine 
acrocentric, but not metacentric, chromosomes 
observed during the pachytene of meiosis may 
allow interchromosomal exchanges that have 
maintained the homogeneity of the Ac2 family. 
Miller et al. (1993) have independently isolated 
what appears to be another member of the Me 
family of repeats and showed that it hybridizes to 
the centromeres of all chromosomes except the 
Y under conditions of low stringency, but only to 
certain metacentric chromosomes under condi¬ 
tions of high stringency. 

Apart from the centromeric repeats, 
McGraw et al. (1988) have isolated a 3.8kb 
repeat element with 80% sequence identity to 
an element independently isolated by Mileham 
et al. (1988). This element is male specific, 
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at least at the level of sensitivity of dot blot 
and Southern hybridization, with more than 
200-fold greater copy number in males than 
females. It contains no internal repetitions and 
no open reading frames. Akamatsu et al. 
(1989) subsequently isolated and sequenced a 
1.2 5 kb repetitive element visible in EcoRl 
digests of both male and female porcine 
genomic DNA. This contains imperfect internal 
repetitions of about 44 bp and has been used as 
a gender-neutral probe for verifying the pres¬ 
ence of target DNA in embryo sexing assays. 


Dispersed repetitive elements 

Transposable elements comprise an important 
group of dispersed repetitive elements, which 
have important biological features, including 
the capacity for insertional mutagenesis and at 
least a theoretical capacity for horizontal trans¬ 
mission. They can be classified into two cat¬ 
egories, namely DNA-mediated elements (class 
2 transposons or DNA transposons) and RNA- 
mediated elements (class 1 transposons, that is 
retrotransposons and retroposons). In mam¬ 
mals, RNA transposons comprise 90-95% of 
transposable elements, whereas in insects, 
plants and protozoa, DNA transposons pre¬ 
dominate (Feschotte and Pritham, 2007). The 
glimpse of the porcine genome provided by 
the Wernersson et al. (2005) partial sequence 
indicates that 92.6% of porcine transposable 
elements are retrotransposons. If SINEs are 
included in the RNA-mediated class, this pro¬ 
portion increases to 95.2%. 

DNA-mediated elements 

DNA-mediated elements transpose via a DNA 
intermediate, generally using a cut-and-paste 
mechanism, although some use a rolling circle 
method of replication and others use methods 
not yet understood (Feschotte and Pritham, 
2007). Twelve superfamilies of DNA trans¬ 
posons have been recognized (Feschotte and 
Pritham, 2007), although only five of these, 
comprising 120 families, have been found in 
humans. Comparisons across many well-charac¬ 
terized mammalian species, including humans, 
mice, rats and dogs, indicate that DNA transposons 


have been relatively inactive in mammalian 
genomes for the past 40-50 million years, 
although there are some rare exceptions, such as 
species of bats, which have a distinct spectrum of 
DNA transposons, some of which still appear to 
be active (Feschotte and Pritham, 2007). 

DNA transposons were discovered inde¬ 
pendently in Drosophila and maize as a result of 
the mutational effects of insertion of the 
elements, but have only been found in mammals 
through sequence analysis. Morgan (1995) first 
discovered sequences in humans that are quite 
closely related to the mariner transposon of 
Drosophila, and are known also to be present in 
a wide range of arthropods and other inverte¬ 
brates. Mariner- like elements (MLEs) are about 
1250 bp in length with inverted repeats of 
20-40bp. Auge-Gouillou et al. (1995) have 
identified MLEs in humans, mice, rats, Chinese 
hamsters, sheep and cattle. Sequence analysis of 
the human, bovine and ovine MLEs, which were 
the most closely related from this group, indi¬ 
cated that they were more similar to the MLEs 
found in Hyalophora cecropia, a lepidopteran, 
than to the mariner elements of Drosophila. 
Indeed, the mammalian and lepidopteran MLEs 
were more similar to each other than to MLEs 
from the two insect groups, although it should be 
pointed out that the mammalian elements were 
isolated with degenerate primers for Cecropia 
MLEs. It has been estimated (Auge-Gouillou et a/., 
1995) that there are more than 100 MLEs in the 
human, ovine and bovine genomes. Most mam¬ 
malian MLEs are thought to be the decaying and 
non-functional remnants of ancient horizontal 
transmission, although Reiter et al. (1996) have 
found indirect evidence that MLE transposase 
could be expressed in humans. 

In pigs, MER1 type and MER2 type ele¬ 
ments comprise approximately 0.9% and 0.3% 
of the total sequence respectively (Wernersson 
et al., 2005); the MER (medium reiterated 
repeat) classes are arbitrary DNA transposon 
classes recognized by the repetitive DNA 
screening software RepeatMasker (Smit et al., 
1996-2010). 

RNA-mediated elements 

RNA-mediated elements (retrotransposons, 
and also retroposons) comprise the other 
major category of transposable elements and 
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transpose via an RNA intermediate, usually 
with the original inserted copy remaining in 
position. They use reverse transcriptase, an 
enzyme that synthesizes DNA from an RNA 
template, and makes a DNA copy from 
the RNA intermediate before integration of 
the element into the genome. The two main 
categories of retrotransposons are the retro¬ 
viruses, which have LTRs, and the LINEs, 
which do not. Retrotransposons encode their 
own reverse transcriptase, as well as inte- 
grase and other enzymes necessary for trans¬ 
position. Retroposons do not encode reverse 
transcriptase and thus are non-autonomous, 
depending on reverse transcriptase activity 
provided by retroviruses or LINEs. SINEs 
have neither LTRs nor reverse transcriptase 
and generally have more or less degenerate 
poly(A) tails. Like pseudogenes, they must 
depend on reverse transcriptase and an inte¬ 
gration mechanism parasitized from LINES 
or retroviruses (Singer, 1982; Ohshima and 

Okada, 2005). 

retroviruses. Endogenous retroviruses, 
mostly defective or incomplete, comprise a 
substantial proportion of the genomes of 
most mammals. Analysis of the initial human 
genome sequence assembly (IHGSC, 2001) 
found that there are about 450,000 
retrovirus-like elements in the human 
genome comprising about 8% of the total 
DNA, although a large proportion of these 
are retroviral fragments, and very few 
complete retroviruses are present. The 
preliminary analysis of the incomplete 
genome sequence of the pig (Wernersson 
et al., 2005) has found that 2.8% of the 
genome sequences consist of retroviral-like 
elements. It is unclear whether this lower 
estimate in pigs reflects a biological reality 
or is an artefact of the incompleteness of the 
sequence or its assembly, or is due to 
ambiguity of the definition of what comprises 
a retroviral sequence. Respective comparison 
between pigs and humans for LINEs (16% 
versus 21%) and SINEs (11.3% versus 13%) 
but not for DNA transposons (1.5% versus 
3%) suggests that perhaps pigs may actually 
have a lower occurrence of endogenous 
retroviral DNA in their genomes than 
humans. 


As a class of retrotransposon, retroviruses 
encode their own reverse transcriptase, inte- 
grase and other enzymes necessary for trans¬ 
position. Complete retroviruses also encode an 
env gene, which specifies the envelope pro¬ 
teins required for packaging infectious retrovi¬ 
ral particles. Although these are generally 
capable of completing the infectious viral cycle 
and thus are capable of horizontal transmis¬ 
sion, most complete endogenous viruses are 
stably inherited and are vertically transmitted 
only. Todaro et al. (1974) were the first to 
show that the PK-15 type C retroviruses 
secreted by pig cells in culture are present in 
multiple copies in the DNA of all pig tissues 
and cells. Thus, these retroviruses are endog¬ 
enous and transmitted vertically. Benveniste 
and Todaro (1975) claimed that a related virus 
was found only in close wild relatives within the 
Suidae, such as the bush pig and the warthog, 
and was absent from the peccary and other 
artiodactyls such as cattle. 

Interest in porcine retroviruses has been 
greatly intensified because they are considered 
to be a potential zoonotic hazard whose poten¬ 
tial impact could be amplified if xenotransplan¬ 
tation of porcine tissue into humans were to 
become a clinical reality. A literature too large 
to review completely in this chapter has devel¬ 
oped. Briefly, FERVs are members of the 
Gammretroviridae. Members of two subfamilies, 
yl and y2, have been extensively characterized. 
The y2 viruses have also been called FERV-E, 
owing to their similarity to the human endogenous 
retrovirus (HERV)-E. There are estimated to be 
up to 50 copies per host genome. These 
endogenous viruses have homogeneous enve¬ 
lope sequences; most are defective and they are 
believed to be of low infectious potential to 
humans (Klymiuk et al ., 2006), so are now 
considered to be of little relevance to 
xenotransplantation. 

yl PERVs have generated an enormous 
level of interest ever since Patience et al. 
(1997) showed that at least some of them were 
capable of infecting human cells in vitro, yl 
retroviruses are classified into PERV-A, B and 
C subfamilies on the basis of very distinct enve¬ 
lope gene sequences. There are up to about 50 
copies of more or less intact endogenous retro¬ 
viruses per host genome, but most of these are 
defective (for example Lee et al ., 2002), with 
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A and B most common and C sometimes 
absent. Tropism, that is, the preference for 
replication in cells from some species or tissues 
rather than others, is determined by the enve¬ 
lope gene sequence. Novel viruses and novel 
envelope sequences can be generated by 
recombination, and indeed the FERV most 
capable of infecting human cells is a de novo 
A/C recombinant (Oldmixon et ai, 2002). 
The possibility of recombination between 
human and porcine retroviruses has been 
argued as a remote but potentially catastrophic 
risk, but such recombinants have not been 
observed. Pseudotyping, the packaging of a 
pig retroviral genome in a human retroviral 
envelope or vice versa, a prerequisite for 
recombination, has not been observed. Indeed, 
in vivo infection of humans with porcine 
retroviruses has never been observed. The 
development of lines of PERV-C free pigs such 
as the Westran (Moran, 2008), incapable of 
producing A/C recombinants, together with the 
use of siRNA molecules targeting endogenous 
retroviruses in transgenically modified pigs 
(Dieckhoff et ai, 2008; Ramsoondar et a/., 
2009), is contributing to increasing confidence 
in the safety of xenotransplantation. 

lines. LINEs are a class of retrotransposon 
and are distinguishable from endogenous 
retroviruses mainly by their lack of env 
sequences and LTRs. Although LINEs encode 
reverse transcriptase, only a very small 
proportion of the elements in the genome 
contain a fully functional reverse transcriptase 
gene which has escaped mutational 
inactivation. Most LINE elements are truncated 
and stranded in their current genomic location, 
where they are doomed to gradually decay 
into random sequence due to unconstrained 
mutation. One LINE family, the mouse LI 
family, is very old, and homologous elements 
have been found in plants and even in protists, 
leading to the suggestion that this is the 
ancestor even of retroviruses, which acquired 
their infectious capability secondarily. About 
21% of the human genome consists of LINE 
sequences, with an estimated 850,000 copies 
(IHGSC, 2001). In the pig, Wernersson et ai. 
(2005) reported that about 16% of their 
incomplete porcine genome sequence 
comprised LINEs, of which LINE1 (14.5%) 


was predominant. In humans and mice, LINE 
elements predominate in the G+C-poor 
G-bands. Miller (1994) described an early use 
of a porcine LINE, which he called LISs, in 
PCR genotyping of pigs for the Hox 2.4 gene. 
Thomsen and Miller (1996) subsequently 
found that this porcine LINE is uniformly 
distributed throughout the euchromatic part of 
the porcine genome, with a slight bias towards 
G-bands. 

sines. The use of the term SINE will be 
restricted here to elements that transpose via 
an RNA intermediate, although it is occasionally 
used in the literature for any small interspersed 
element. Because of their small size, SINEs do 
not encode reverse transcriptase, although 
some are transcribed. SINEs were originally 
discovered in humans (Alu repeats) and mice 
(B1 and B2 repeats) owing to their very high 
copy number in the genome: the Alu repeat 
occurs about 500,000 times in the human 
genome. SINEs are generally less than 200 bp 
in size and are derived from small cellular RNA 
species, which have been reverse transcribed 
and then integrated into the genome. In effect, 
they are processed pseudogenes that have 
attained extremely high copy number. 
Structural RNAs, such as transfer RNAs 
(tRNAs), 5S ribosomal RNA (rRNA) and 7SL 
RNA (a component of signal recognition 
particle ribonucleoprotein), have independently 
become the target for reverse transcription 
and integration in humans and mice to form 
SINEs. It is believed that certain mutations 
within structural RNAs predispose them to 
become templates for reverse transcriptase. In 
particular, nucleotide changes in the 3' region 
of an RNA that cause self-complementarity 
may lead to the formation of hairpin loops, 
and this intra-strand priming allows these 
RNAs to act as templates for synthesis of a 
DNA copy (cDNA - complementary DNA) by 
reverse transcriptase. In the presumably rare 
circumstances of the expression of reverse 
transcriptase in a cell, these mutated structural 
RNAs form effective templates for the synthesis 
of multiple cDNAs that are able to integrate 
into the genome. 

In the pig, SINEs are ubiquitous. Three 
families of SINEs have been recognized, with 
the PRE-1 (porcine repetitive element-1) family 



80 


C. Moran 


(Singer et a/., 1987) being the most abundant 
of these. The consensus PRE-1 element is 
233bp in length, and Singer et al. (1987) have 
estimated that there are 50,000-100,000 cop¬ 
ies interspersed throughout the porcine 
genome. Ellegren (1993a,b) has estimated from 
database searching that PRE-1 occurs about 
once every 12 kb in the porcine genome, imply¬ 
ing that there are 250,000 copies if the ele¬ 
ment is randomly distributed. Other estimates 
(Frengen et a/., 1991; Takahashi et a/., 1992) 
are as high as 500,000. Yasue et al. (1991), 
using a Hinc fragment containing three PRE-1 
elements, discovered evidence for an uneven 
distribution of PRE-1 along the chromosomes, 
surprisingly favouring the vicinity of the centro¬ 
meres. They found no correlation with the 
distribution of Q-bands. In contrast, Sarmiento 
et al. (1993) have shown by in situ hybridiza¬ 
tion that a PRE-1 element, which they called 
Cll, is distributed uniformly throughout the 
porcine genome, but spares the centromeres of 
acrocentric chromosomes. The element used 
by Sarmiento et al. (1993) was cloned from 
within the porcine MHC (major histocompati¬ 
bility complex) and showed a pattern of appar¬ 
ent non-random association with other porcine 
repetitive elements within the MHC, but not 
elsewhere in the genome. It is difficult to recon¬ 
cile these observations unless there are 
subfamilies of the PRE-1 element with different 
patterns of genomic distribution. Thomsen and 
Miller (1996) have also examined the distribu¬ 
tion of PRE-1 and have found no reproducible 
clustering of label, but, in agreement with 
Sarmiento et al. (1993), signal was absent or 
reduced in the centromere regions, particularly 
of acrocentric chromosomes; however, they did 
find some evidence of heterogeneity of 
distribution of particular elements within the 
PRE-1 family, with one element apparently 
preferentially located in strong R-bands. 

The 3' terminus of a PRE-1 element con¬ 
sists of a poly(A) tail of variable length. Miller 
and Archibald (1993) also found that (CA) n . 
(GT) n repeats occur at the 3' termini of PRE-1 
in 12% of cases, whereas Alexander et al. 
(1995) found 24% of their randomly isolated 
(CA) n .(GT) n microsatellite loci to be associated 
with PRE-1. Wilke et al. (1994) found about 
36% of their (CA) n .(GT) n repeats from small 
insert clones to be associated with PRE-1, with 


80% of the (CA) n .(GT) n repeats occurring at 
the 3' end of the SINE. As described elsewhere, 
both the (A) n .(T) n and (CA) n .(GT) n repeats pro¬ 
vide a valuable source of genetic markers. The 
consensus PRE-1 element contains internal 
imperfect direct repeats of about 38 nucleo¬ 
tides and an RNA polymerase III split promoter, 
which is found in all the structural RNAs that 
are transcribed by RNA polymerase III. Singer 
et al. (1987) found PRE-1 to be transcribed in 
liver and thymus, but did not test any other 
tissue. The PRE-1 consensus sequence is 
homologous to tRNA ars genes (Takahashi et al., 
1992), with a similar evolutionary origin to the 
mouse B2 repeat that is derived from tRNA ser 
(Daniels and Deininger, 1985). 

Alexander et al. (1995) reported two 
novel families, ARE-IP and ARE-2P (artiodac- 
tyl repetitive element-1 (or 2) porcine), during 
sequencing of clones containing AC n microsat¬ 
ellite sequences. Related AREs were subse¬ 
quently found in cattle. Alexander et al. (1995) 
estimated the porcine haploid copy number of 
ARE-IP as about 3.6 x 10 4 and about 2.4 x 
10 4 for ARE-2P, with 8 x 10 3 copies appar¬ 
ently being dimers of ARE-IP and ARE-2P. 
These dimeric elements generally enclose a 
(CA) n .(GT) n microsatellite repeat. True stretches 
of polyadenylation were not observed in all 
ARE-IP and ARE-2P elements, although the 
high adenine content implies that a poly(A) tail 
may have been present but has degenerated. 
Direct repeats within the ARE SINEs were 
absent, in contrast to PRE-1. The evolutionary 
origin of the ARE SINEs is not immediately 
apparent, although the tRNAs and 7S1 RNA 
can be ruled out. The ARE SINE consensus 
sequences lack an RNA polymerase III split 
promoter, implying that they are transcribed by 
RNA polymerase II (Alexander et al., 1995), 
which transcribes protein-coding genes and 
the small nuclear ribonucleoprotein genes. 

The PRE-1 family is absent from the 
Bovidae and more distantly related lineages, 
such as mice and humans (Takahashi et al., 
1992). Yasue and Wada (1996) have shown 
that PRE-1 is present in warthogs and the col¬ 
lared peccary at about the same level as in 
domestic pigs. This specificity to the Suidae and 
Tayassuidae implies that it originated after their 
divergence from the ruminants. By examining 
the differences among 22 PRE-1 sequences, 
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Yasue and Wada (1996) estimated the time of 
divergence of these elements from an ancestral 
element as about 43.2 million years before 
present, setting an upper limit on the age of 
these three species. In contrast, the presence of 
ARE SINEs in pigs, cows and possibly sheep 
implies that these sequences were undergoing 
retroposition before the radiation of Suidae and 
Bovidae (Alexander et a/., 1995), and thus are 
much older than PRE-1. 

Wernersson et al. (2005) estimate that 
11.3% of the porcine genome sequence 
comprises SINEs, with the ancient MIRs 
(mammalian-wide interspersed repeats) - which 
are tRNA-derived SINEs which predate the 
mammalian radiation - making up 1.75% of 
the genome. Several laboratories have devel¬ 
oped assays for detection of porcine DNA (in 
the context of meat substitution or mixing) 
which are based on SINE elements, including 
Walker et a/. (2003). Nikaido et al. (1999) 
have reported on the use of SINEs (and LINEs) 
in resolving the monophyletic phylogenetic 
relationship of pigs and peccaries to the 
exclusion of hippopotamus. 

processed pseudogenes. Pseudogenes are 
genomic elements which are non-functional but 
have recognizable similarity to functional genes. 
Processed pseudogenes arise from reverse 
transcription of mRNA followed by integration 
of the cDNA copy. Processed pseudogenes 
are smaller than their parental genes because 
they lack introns, which are spliced out during 
transcription, but they do possess post- 
transcriptionally acquired poly(A) tails. They 
also lack the 5' and 3' regulatory sequences 
necessary for the controlled expression of 
the parental genomic copy of the gene, 
including promoter elements. Other than by 
coincidentally acquiring a new promoter, 
processed pseudogenes are not transcribed. 
Freed of adaptive constraints, they decay 
relatively rapidly owing to the accumulation of 
mutations. Goncalves et al. (2000) have 
estimated that there are up to 33,000 pro¬ 
cessed pseudogenes in the human genome. 
Remarkably, there are apparently only 51 in 
the chicken genome, which is attributed to the 
absence of LINE1 elements (Hillier et al ., 
2004), as the main chicken LINE, CR1, cannot 
reverse transcribe polyadenylated mRNAs. 


Harbitz et al. (1993) described the first 
porcine example of a processed pseudogene, 
derived from the glucosephosphate isomerase 
(' GPI) locus. This is obviously an ancient ele¬ 
ment, because it contains 181 transitions, 78 
transversions, 11 deletions and one insertion, 
and has several stop codons relative to the 
functional cDNA, with which it retains only 
83% nucleotide identity. Assuming a neutral 
mutation rate of 0.7-1.0% per nucleotide per 
million years, Harbitz et al. (1993) estimated 
that the pseudogene arose 20 million years 
ago. The pseudogene has been truncated at 
the 3' end and lacks the poly(A) tail that typifies 
most processed pseudogenes. Only a single 
chromosomal site has been found for this GPI 
pseudogene. 

A more recent example of processed 
pseudogenes in pigs is provided by ribosomal 
protein SA (RPSA), which is involved in the 
binding and internalization of prion proteins. 
Two processed pseudogenes and a single non- 
processed pseudogene have been identified 
and mapped in the pig (Knorr et al., 2007). 
The processed pseudogenes, RPSAP1 and 
RPSAP3 , have three and 13 indels, 36 and 57 
substitutions and six and eight premature stop 
codons, respectively, with respect to the func¬ 
tional gene. The pseudogenes contain LINE 
and SINE-like repeats as well. 

The Pseudofam database (Lam et a/., 
2009) has catalogued pseudogenes, including 
non-processed pseudogenes, in ten species 
with complete genome sequences, including 
the mouse, rat, human, chimpanzee and dog, 
but unfortunately not yet the pig. In humans, 
for example, there are 2.6 pseudogenes on 
average per parent gene, and about 7% of 
genes have pseudogenes. In general, house¬ 
keeping genes are more likely to have pseudo¬ 
genes than other loci. 


Minisatellites 

Minisatellites are tandemly repeated elements, 
with the tandem repeat groups dispersed 
throughout the genome. Together with micro¬ 
satellites, they contradict the artificial dichotomy 
of repeats into tandem versus dispersed classes. 
Minisatellites have repeating units of 9-64 or 
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more nucleotides, and were first described in 
humans (Jeffreys et a/., 1985) but have subse¬ 
quently been discovered in numerous other 
species. The number of core repeat units varies 
enormously at particular sites, providing a 
potentially useful source of genetic markers. 
Because of this hypervariability in tandem copy 
number, minisatellites are also called VNTRs 
(variable number tandem repeats) (Nakamura 

et a/., 1987). 

Coppieters et a/. (1990) discovered a por¬ 
cine minisatellite, consisting of 15/18 bp 
repeat units, while attempting to clone the 
serum protein P02 locus. The core sequence 
showed a high degree of similarity to the 
minisatellite core sequence found by Jeffreys 
et al. (1985) in humans. When used in low- 
stringency hybridization, the clone containing 
the core repeat units detected a highly variable 
fingerprint pattern in pigs, proving multiple 
locations of the element in the genome. It also 
detected a variable fingerprint pattern in horses 
and rabbits. 

Davies et ai (1992) subsequently described 
a minisatellite located within the porcine 
glucosephosphate isomerase locus, with a 
repeating unit of 39 bp (AGGACCAGGGTC 
ATGTACAGGTAGGCAGAGCTGGTCTG), 
flanked by divergent repeats of 53/54bp. The 
core unit had at least 14 perfect tandem copies. 
The repeat unit has no homology with other tan¬ 
dem repeat sequences. The minisatellite appar¬ 
ently has but a single genomic location, because a 
fingerprint pattern was not obtained even under 
conditions of low hybridization stringency. 

Signer et ai (1994) reported a porcine 
minisatellite isolated from a porcine genomic 
library using the human core sequence of 
Jeffreys et al. (1985) as probe. The porcine 
minisatellite hybridized to multiple genomic loca¬ 
tions under conditions of low-to-intermediate 
stringency, but interpretable single-locus 
banding patterns could be obtained by high- 
stringency hybridization and washing. The 
element was found by linkage analysis and in 
situ hybridization to map to the pseudoauto- 
somal region of the X and Y chromosomes. 
No estimation was made of the size of the 
core unit, variation in number of the repeat 
units or number of genomic sites of related 
elements, although allelic variants ranged 
from 3 to 18 kb. 


Coppieters et al. (1994) isolated a porcine 
minisatellite using a CAC/CAT triplet probe. 
Several clones were independently isolated, 
which contained the same 30 bp repeat unit 
(consensus GATGAGGATGGGGGATTGG 
AGATGGATGGA). The minisatellite maps to 
the telomeric region of chromosome 14q. It 
was estimated that there are more than 100 
tandemly repeated units. 

Despite their hyperpolymorphism, minisat¬ 
ellites have fallen into disfavour as genetic mark¬ 
ers in recent years, mainly because they are not 
amenable to FCR, owing to the large repeat 
size and high repeat number. Their preferential 
location in telomeric regions of chromosomes 
(Royle et a/., 1988) was another reason for 
abandoning them as general genetic markers. 


Microsatellites 

Like minisatellites, microsatellites are VNTRs. 
However, they have the advantage of small 
repeat units (1-6 nucleotides) and a reasonably 
small copy number, which makes them very 
suitable for PCR amplification. The extremely 
high number of locations, which appear to be 
randomly located throughout the genome, 
makes them ideal genetic markers, and the 
linkage maps of most mammalian species, 
including the pig, have initially been built 
largely of microsatellite markers. 

Certain categories of microsatellite repeats 
predominate in mammalian genomes, with (AC) n 
repeats being the most widely exploited for map¬ 
ping in mammals. Moran (1993) provided an 
early report on the incidence of all possible 
classes of microsatellite repeats in the pig by sys¬ 
tematically searching for repeat clusters in excess 
of about 20 nucleotides for mononucleotide 
repeats through to tetranucleotides for all por¬ 
cine sequences lodged in the GenBank database. 
The interpretation of the results is complicated 
by the fact that microsatellites were found in 
both cDNA and genomic DNA sequences of the 
approximately 181 nuclear genes represented at 
that time. Nevertheless, an approximate feel for 
the range and distribution of types of repeats 
could be gained. Eight (A) n .(T) n mononucleotides 
and one (C) n .(G) n repeat were found. For the 
dinucleotides, seven (CA) n .(GT) n repeats and 
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three (GA) n .(CT) n were recognized. Five trinucleo¬ 
tides, representing only four of the 20 possible 
types, were found. Finally, nine tetranucleotides, 
representing only six of 60 possible categories, 
were found. The distribution of the elements 
within each category was 27.3% mononucleo¬ 
tides, 30.3% dinucleotides, 15.1% trinucleotides 
and 27.3% tetranucleotides. The (A) n .(T) n mono¬ 
nucleotides, at 24%, and the (CA) n .(GT) n dinucleo¬ 
tides, at 21%, were by far the most common 
categories of specific repeats found. Ellegren 
(1993b) has also searched the DNA sequence 
databases for mononucleotide and dinucleotide 
repeats only, recording the occurrence of runs of 
ten or more nucleotides. He also found (A) n .(T) n 
and (CA) n .(GT) n repeats to predominate, but 
found the (A) n .(T) n repeats to be about five times 
more common than (CA) n .(GT) n dinucleotides. 
Using data from genomic sequences only, he 
estimated that (A) n .(T) n mononucleotides occurred 
about once every 7 kb and (CA) n .(GT) n repeats 
occurred about every 30kb, implying 400,000 
copies of (A) n .(T) n repeats and 100,000 copies 
of (CA) n .(GT) n in the porcine genome. Ellegren 
(1993b) found that one-quarter of the (A) n .(T) n 
repeats were associated with PRE-1 SINEs. 
Neither Moran (1993) nor Ellegren (1993b) 
reported any (AT) n .(TA) n repeats, although Zhang 
et al. (1995) later identified and mapped an 
(AT) n .(TA) n repeat associated with a PRE-1 SINE 
in the 3' UTR of the soluble angiotensin-binding 
protein gene. 

Wintero et al. (1992) have estimated that 
there are 65,000-100,000 (CA) n .(GT) n repeat 
loci in the porcine genome, from the proportion 
of positive clones in a genomic library. Using in 
situ hybridization, they showed the chromo¬ 
somal distribution of these repeats to be more 
or less uniform across all chromosomes, except 
that they were under-represented in the centro- 
meric regions, the q arm of the Y chromosome, 
the telomeres, 16q21, which is a C band posi¬ 
tive interstitial site, and the NOR region on lOp. 
Thus (CA) n .(GT) n microsatellite repeats seem to 
be less common in regions containing other 
categories of repeated DNA. 

The origin of (A) n .(T) n repeats in mammalian 
genomes can be easily explained in terms of ret- 
roposition events, which incorporate poly(A) tails 
into the chromosomal DNA (Ellegren, 1993a), 
although it is far from certain that all (A) n .(T) n 
mononucleotides are generated in this way. 


The gradual decay of poly(A) tails can explain 
the origin of at least some of the less common 
related repeats found, and clearly is responsible 
for the (AT) n .(TA) n repeats mapped by Zhang 
et al. (1995). However, when it comes to the 
more common dinucleotide categories, such as 
(CA) n .(GT) n and (GA) n .(CT) n , it is unclear whether 
this generalization can be extended further. In 
support of the evolution of (CA) n .(GT) n repeats 
from degenerate poly(A) tails is the frequent 
observation by many groups of the contiguity in 
the porcine genome of the SINE element, PRE-1 
and (CA) n .(GT) n repeats, as mentioned previ¬ 
ously. Indeed, Alexander et al. (1995) discovered 
two new elements, ARE-IP and ARE-2P, in the 
process of examining the sequences flanking 
their other (CA) n .(GT) n sequences, finding them 
present in 6% of cases. Assuming that these are 
degenerate poly (A) tails, this means that either 
there is a very large number of heterogeneous 
uncharacterized SINE elements, for which no 
consensus sequence has been derived or, alterna¬ 
tively, that the other 70% of (CA) n .(GT) n repeats 
have evolved by some other mechanism. Because 
replication slippage is widely accepted as the 
mechanism for the generation of diversity of 
microsatellite length, it is conceivable that any 
non-coding sequence containing two or more 
dinucleotide repeats is capable in theory of 
growing via this mechanism to the size where it 
would be recognized as a microsatellite locus. 
However, the lesser propensity of small repeats 
to generate new alleles would argue against this 
hypothesis. 

While microsatellites are still very valuable 
genetic markers for certain applications, such 
as analyses of population structure (Cowled 
et al ., 2008), they have been usurped by single 
nucleotide polymorphisms (SNPs) in high- 
resolution genetic mapping and molecular 
breeding value estimation, where the availabil¬ 
ity of SNPs in huge numbers on easy-to-use 
and relatively cheap genotyping platforms 
makes them the marker of choice. 


Telomeric repeats 

From very early studies of chromosome behav¬ 
iour, it was observed that the ends or telomeres 
of chromosomes had special properties, which 
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protected them from the degradation and 
fusion seen with recently broken chromosome 
ends. It was also realized that conventional 
DNA polymerases, which require priming with 
a short RNA fragment, would be incapable of 
synthesizing DNA right to the ends of discon- 
tinuously growing strands, implying that there 
would be a gradual shortening of the chromo¬ 
somal ends during chromosome duplication. It 
is now recognized that mammalian chromo¬ 
somes are capped by arrays of the hexamer 
(TTAGGG) n (Blackburn, 1991), which can be 
repeated for up to 10-15 kb in humans and for 
100-150 kb in rodents. This repeat is very rare 
elsewhere in the genome. Interstitial sites 
generally are points of evolutionary chromo¬ 
somal rearrangement, such as the telomere 
to telomere fusion that gave rise to human 
chromosome 2 (Ijdo et al., 1991) and the 
interstitial site near the centromere of mouse 
chromosome 6 (Yen et a/., 1995). 

The 3' end of the telomere in mammals is 
single stranded with an overhang of 50-100 
nucleotides. This single-stranded overhang 
loops around to invade the double-stranded 
part of the telomere to form a structure called 
the t-loop (Palm and de Lange, 2008), which 
hides the end of the telomere and protects 
it from damage. However, the t-loop alone 
is insufficient protection, and a number of 
proteins also bind to the telomere to protect it 
from nucleases and to distinguish it from a 
broken chromosome end. Three of the pro¬ 
teins of the six-protein shelterin complex, 
Telomere Repeats binding factors 1 and 2 
(TRF1 and 2) and Protection of Telomeres 
1 ( POT1 ), specifically recognize and bind to 
TTAGGG, with TRF 1 and 2 binding first and 
then recruiting the other three factors (Palm 
and de Lange, 2008). 

In germline cells, the telomere repeats are 
synthesized by a distinct replicative mechanism 
from normal genomic DNA, using a ribo- 
nucleoprotein reverse transcriptase enzyme 
called telomerase (Morin, 1989). Telomerase 
contains RNA complementary to the telomeric 
repeats which it synthesizes, and thus is not 
dependent on the telomeric DNA template. 
Telomerase is not expressed in somatic cells in 
mammals. Consequently, the duplication of 
chromosome ends by conventional polymer¬ 
ases will lead to a reduction in the number of 


copies of the telomeric repeats, and it has been 
observed that the number of copies of telo¬ 
meric repeats decreases as an animal ages. 
However, attempts to use telomere length to 
estimate the age of animals have generally 
been disappointing. 

Meyne et al. (1989) demonstrated the 
conservation of the (TTAGGG) n telomeric 
sequence among 91 species of vertebrates, 
including fish, amphibians, reptiles, birds and 
mammals, by in situ hybridization of the 
repeat to the telomeres of all chromosomes in 
all the species examined, although the pig 
was not among the 70 mammalian species 
tested. Subsequently, Gu et al. (1996) have 
demonstrated, using primed in situ synthesis 
(PRINS), that this sequence resides at the tel¬ 
omeres of all porcine chromosomes. They 
also found an interstitial site at chromosome 6 
q21-q22, at a point where there is a disjunc¬ 
tion in the syntenic relationship between pig 
6 and human lp and 19q chromosomes. This 
site of hybridization gradually disappeared 
with increasingly stringent conditions of 
hybridization of the primer, implying that the 
internal site is non-functional and has under¬ 
gone gradual mutational decay. Gu et al. 
(1996) interpreted this interstitial position as 
evidence for a tandem fusion in the origin of 
porcine chromosome 6. 

Expressed repetitive sequences 

Mammals, like other organisms, contain some 
repeated sequences that are expressed and 
have important cellular functions. In some 
cases, the repetition of the gene is necessary to 
ensure adequate levels of expression of the 
gene product. In other cases, duplication of the 
gene locus has permitted evolutionary experi¬ 
mentation as the duplicate is able to diverge 
and evolve new functions. Infrastructure RNAs, 
such as rRNAs, are an important class of 
expressed repeats. 

Ribosomal RNA (rRNA) 

rRNA is a structural component of ribosomes 
and thus comprises a vital component of the 
cellular protein synthesis apparatus. In eukary¬ 
otes, the ribosome consists of a small subunit 
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(40S) containing the 18S rRNA and a large 
subunit (60S) containing the 28S, 5.8S and 
5S rRNA subunits. The 28S, 18S and 5.8S 
genes are clustered and expressed coordi- 
nately. Cells must express rRNAs at extremely 
high levels to meet the cellular demand for 
protein synthesis, and have a special enzyme, 
RNA polymerase I, dedicated specifically to 
this task. rRNA generally comprises about 
80% of the RNA found in a cell. Multiple cop¬ 
ies of the rRNA genes expedite transcription. 
In humans, there are 150-300 copies of the 
18S/5.8S/28S rRNA cluster distributed 
between five autosomes, with the number of 
copies varying between individuals and chro¬ 
mosomal sites. The positions of the tandemly 
repetitive ribosomal gene clusters, which are 
separated by untranscribed spacers, are cyto- 
logically visible at features called secondary 
constrictions or nucleolus organizer regions 
(NORs). Their expression is controlled by a 
promoter that directs transcription of all 
members of the cluster as a 45S pre-rRNA 
transcript, which is processed into the 18S, 
5.8S and 28S subunits. 

In the pig, three sites of the rRNA 
18S/5.8S/28S cluster, called RNR1, RNR2 
and RNR3, are located on chromosome 8pl.2, 
1 Op 1.2-1.3 and 16q2.1 respectively (Czaker 
and Mayr, 1980; Miyake etal., 1988; Popescu 
et a/., 1989; Bosma et a/., 1991). 

The 5S (about 120 bp) rRNA genes are 
also tandemly repeated and separated by 
untranscribed spacers. However, in humans, 
they are transcribed by RNA polymerase III 
and map to quite different locations from the 
28S/18S/5.8S cluster, having a major site on 
human chromosome lq42.11-42.13, with a 
minor site nearby at lq31 (Lomholt et a/., 
1995a). Likewise the 5S rRNA genes in pigs 
map to 14q23 (Lomholt et a/., 1995b), 
nowhere near the other rRNA genes. 

Ling and Arnheim (1994) have character¬ 
ized the promoter region of the 18S/5.8S/28S 
cluster in the pig and compared it with equiva¬ 
lent mammalian promoters. This revealed con¬ 
servation of the regulatory regions found in 
other mammalian promoters. The only unu¬ 
sual feature was that there was a C nucleotide 
16 bp upstream of the transcription start point 
in the pig, whereas G is conserved in all other 
eukaryotic promoters. 


Transfer RNAs (tRNAs) 

Like 5S rRNA, tRNAs are transcribed by RNA 
polymerase III. As for rRNAs, the very high 
cellular demand for tRNAs can be met only by 
transcription from multiple copies of the genes. 
Generally there are ten to several hundred 
genes for each tRNA. Interestingly, the repeat 
clusters contain genes for several different 
tRNAs. For example, in the rat, tRNA Asp , 
tRNA Gly and tRNA Glu form a cluster, which is 
tandemly repeated about ten times. The mem¬ 
bers of the tandem repeats are not identical, 
and some may be pseudogenes. Very little has 
been reported about the tRNA genes of the 
pig, but it can be safely assumed that they fol¬ 
low a typical mammalian pattern of dispersed 
distribution and duplication which will be fully 
revealed by a complete genome sequence. 

Other multigene families 

In mammals, there are many protein-encoding 
gene families in which the members show 
developmental or tissue specificity of expres¬ 
sion. For example, the globin gene family con¬ 
sists of members coordinately expressed at 
embryonic, fetal and adult stages, and includes 
pseudogenes. The a-actin family consists of 
members whose expression is specific for skel¬ 
etal muscle or for cardiac muscle. In some 
cases, the members of these gene families are 
located very closely in the genome, whereas in 
other more heterogeneous gene families, the 
members may be dispersed. The individual 
members of these gene families are generally 
structurally similar to the single-copy genes 
discussed later. 

functional genes. In the pig, numerous 
multigene families have been recognized. 
For example, the tumour necrosis factor 
alpha (TNFA) and tumour necrosis factor 
beta (TNFB) loci are located within the MHC 
and separated by a distance of only 2.5kb 
(Chardon et a/., 1991). ATP1A1 and 
ATP1B 1, the alpha 1 and beta 1 polypeptides 
of the Na + , K + -ATPase, also comprise 
dispersed members of a putative gene 
family on chromosome 4 (Lahbib-Mansais 
et a/., 1993), which is conserved in humans 
and mice. 
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The casein family of milk proteins has been 
thoroughly characterized in the pig and retains a 
similar structure to that found in other mam¬ 
mals. Four members, namely a SI (Alexander 
and Beattie, 1992a), a S2 (Alexander et al., 

1992) , (3 (Alexander and Beattie, 1992b) and 
k (Levine et al. , 1992), have been mapped to 
chromosome 8 and the first three at least have 
been shown to be very tightly linked (Archibald 
et a/., 1994). 

Two very well characterized porcine multi¬ 
gene families, namely the protease inhibitors 
(PI) and the immunoglobulin heavy chain (IGH) 
gene families, coincidentally map near each 
other on porcine chromosome 7. P/1, POIA, 
POIB, PI2, PI3 and PI4 form a tight linkage 
group adjacent to IGH1, IGH2, IGH3 and 
IGH4 (Gahne and Juneja, 1986; Juneja et al., 
1986; Van de Weghe et al., 1987; Vogeli 
et al., 1987; Stratil et al., 1990; Cizova et al., 

1993) on 7q24-26 (Gu et al., 1994; Musilova 
et al., 1995). POIA and POIB, originally 
called postalbumin 1A and IB, respectively, 
appear to be cysteine protease inhibitors 
(Gahne and Juneja, 1986). P/1, PI2, PI3 and 
P/4 are serine protease inhibitors, with differ¬ 
ent loci and even alleles at particular loci being 
identified as trypsin or chymotrypsin inhibitors. 
Stratil et al. (1990) noted that there are many 
weakly staining alleles and null alleles at the 
PI3 locus, which is consistent with the evolu¬ 
tionary expectation within a multigene family 
containing redundant members. 

With increased scrutiny of the porcine 
genome, more gene families are being discov¬ 
ered and soon all will be revealed by a com¬ 
plete genome sequence. An interesting 
example is provided by the kallikreins, which 
are serine proteases expressed in a diverse 
range of tissues, and are often involved in acti¬ 
vating proteins from an inactive precursor. In 
humans, there are 15 members of this gene 
family mapping to HSA 19ql3.3-ql3.4. In 
the mouse there are even more, between 24 
and 37, of which 14-26 are pseudogenes, all 
co-located on MMU7. Fernando et al. (2007) 
have reported 13 members of this family in the 
pig, mapping to SSC6q 12-21. Orthology 
could be determined for all members of the 
family between human and pig, except for 
KLK2 and KLK3, which are absent from the 
pig and also from the mouse. The organization 


of the members of the gene family is similar 
across these three species, with all genes in all 
species oriented in the same transcriptional 
orientation from telomere to centromere. 

miRNA families (miR) constitute a totally 
novel class of gene families. These families 
consist of members that differ by only one or 
two nucleotides, but given the small size of 
miRNAs, at about 22 nucleotides, there is 
limited scope for variation. Reddy et al. (2009) 
reported evidence in the pig for all three mem¬ 
bers of the miR-1 family, miR-la, miR-lb and 
miR-lc, which are present in a diverse range of 
species. Evidence was found also for all ten 
members (a to j) of the let-7 family in the pig, 
although there is some nomenclatural confu¬ 
sion for miRNAs, as miR-98, reported by 
Reddy et al. (2009), differs by only one nucleo¬ 
tide from the let-7 family and should really be 
considered a member of it. 

pseudogenes. Pseudogenes can arise from 
duplication of a locus followed by an inactivating 
mutation, such as a frameshift, the incorporation 
of a stop codon or the loss or mutation of 
promoter elements. Like the processed 
pseudogenes already described, these duplicate 
elements are no longer subject to adaptive 
evolutionary constraint and diverge relatively 
rapidly from the parental sequences. Such 
divergent, non-functional loci are frequently 
recognized as ‘degenerate’ members of gene 
families. 

Vage et al. (1994) have reported a pseu¬ 
dogene within the porcine MHC. The MHC 
class II DRB locus was found to have an 
expressed version, called DRB1 , and a pseudo¬ 
locus, called DRB2, present in about 60% of 
chromosomes. DRB2 has a single nucleotide 
deletion in codon 54, which would cause a 
frameshift and total loss of function. Ironically, 
the strong selection pressure for MFIC diversity 
has meant that the functional DRB1 was found 
to be much more variable than the pseudo¬ 
gene. Brunsberg et al. (1996) have confirmed 
the findings of Vage et al. (1994) and have 
discovered an additional DRB pseudogene. 
Both were detected using primers that span 
intron-exon boundaries. Thus, the pseudogene 
is clearly a degenerate member of a gene fam¬ 
ily. Similarly, Mege et al. (1991) have described 
a pseudogene within the interferon family, 
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which they called psi IFN-alpha II-1 , which has 
multiple frameshift mutations. More recently, 
Knorr et al. (2007) have described RPSAP2, a 
non-processed pseudogene of the ribosomal 
protein SA (RPSA), believed to have arisen by 
segmental duplication. This pseudogene, which 
has introns unlike the processed pseudogenes 
of RPSA described earlier, has seven indels, 25 
substitutions and ten premature stop codons 
compared with the RPSA coding sequence. It 
also has seven SINE (tRNA-Glu) and three 
DNA transposons inserted into its UTRs and 
introns. Clearly the evolutionary experimenta¬ 
tion following gene duplication can lead in 
some cases to functional novelty, but is proba¬ 
bly more likely to lead to loss of function and 
decay of the gene. 

Copy number variants and segmental 

duplication 

Although duplications or deletions of relatively 
large segments of chromosomes have been 
known ever since the Bar eye phenotype in 
Drosophila was attributed to a cytologically 
visible duplication, genome-sequencing 
projects have revealed that copy number vari¬ 
ants (CNVs), frequently attributable to seg¬ 
mental duplications, are common in the 
genomes of many organisms and may have 
important phenotypic consequences. Fadista 
et al. (2008) have addressed the occurrence 
of CNVs on parts of SSC4, 7, 14 and 17, 
using a preliminary assembly of parts of the 
porcine genome sequence, to gauge the fre¬ 
quency of CNVs greater than 1 kb in a sample 
of 12 boars. They identified 37 validated 
CNVs, with five overlapping segmental dupli¬ 
cations and several others overlapping known 
genes, and putatively reflecting gene family 
expansion or deletion. Use of a more com¬ 
plete and better assembled genomic sequence 
and sampling of more animals will obviously 
reveal many more of these features in the 
porcine genome. Fadista et al. (2008) esti¬ 
mated that about 0.18% of the porcine 
genome is tolerant of copy number variation. 
Du et al. (2009) have found evidence that 
segmental duplications in pigs have dispro¬ 
portionately affected genes involved in lipid 
metabolism, putatively related to the history 
of selection for fatness in pigs. 


Single-copy DNA 

Many single-copy protein-encoding genes 
have now been cloned and characterized in 
the pig, and it is well beyond the scope of this 
review to attempt to cover all of them. Some 
will be discussed in other chapters. Porcine 
genes appear in all respects quite typical for 
mammalian genes. The porcine calcium 
release channel (CRC) locus, which was the 
subject of an intensive search by several labo¬ 
ratories, has an extremely large message of 
over 15kb (Fujii et al ., 1991), but so also do 
the human and rabbit homologues (Zorzato 
et al., 1990). The size of the entire genomic 
copy of this locus, including introns, has not 
been determined, but is likely to be about 
120 kb, given that 25,780 nucleotides of 
genomic DNA are required to specify only 
3305 nucleotides of the porcine message 
from nucleotide 4624 to 7929 (Leeb et al., 
1993). Protein-encoding sequences account 
for only 13% of the genomic DNA in this 
region, again emphasizing the large propor¬ 
tion of the genome that is non-coding and of 
unknown function, even in the vicinity of cod¬ 
ing genes. Seemann et al. (2007) have 
reported the detection of 18,621 protein¬ 
coding RNAs in the Sino-Danish PigEST 
database, which is likely to be only slightly 
short of the number of protein-encoding 
genes in this species. 


Gene Structure and Function 

Protein-encoding genes 

The porcine growth hormone gene demon¬ 
strates the structural and functional features of 
a typical protein-encoding gene. The porcine 
gene retains extensive sequence similarity with 
the human, rat and bovine homologues, even 
in the non-coding promoter, intronic and 
untranslated regions (Vize and Wells, 1987). It 
is instructive to review the regulatory and struc¬ 
tural features of this locus in the sequence of 
Vize and Wells (1987). Many regulatory fea¬ 
tures held in common with other mammalian 
genes can be seen. Starting at position -30 
from the cap site is the sequence TATA AAA, 
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the Goldberg-Hogness box, required for cor¬ 
rect positioning of RNA polymerase II and ini¬ 
tiation of transcription. An adenine nucleotide 
at position +1, the cap site, marks the start of 
the messenger RNA, which is capped by the 
addition of 7-methylguanylate in an unusual 
5-5 linkage by a capping enzyme. This cap¬ 
ping protects the RNA from degradation. The 
initiating methionine codon, ATG, which marks 
the start of translation of all eukaryotic genes, 
commences at position +64. Thus the porcine 
growth hormone message has a 5' UTR 
(untranslated region) of 63 nucleotides. Only 
ten coding nucleotides occur before the start of 
the first intron. It is virtually universal for the 
splice donor site, that is the first two nucleo¬ 
tides of an intron, to be GT, but here the 
242 bp intron commences with GC, which is 
very unusual. However, intron 1 terminates 
with a typical splice acceptor dinucleotide, AG. 
Exon 2 is 162 nucleotides long. Intron 2, which 
is 210 nucleotides long, has typical splice 
donor and acceptor sites, namely GT and AG, 
respectively, as do all the other introns in this 
gene. Exon 3 is 117 nucleotides in length and 
is followed by intron 3 of 197 nucleotides. 
Exon 4 is 162 nucleotides in length, followed 
by intron 4 of 278 nucleotides. Exon 5 consists 
of 198 nucleotides that specify amino acids fol¬ 
lowed by the translation STOP codon, TAG. 
Following this coding sequence is a 3' UTR of 
102 nucleotides. The sequence AATAAA, 
which is a polyadenylation signal, is 80 nucleo¬ 
tides after the end of the stop codon. This 
directs an enzyme called poly(A) polymerase to 
add a poly(A) tail to the end of the message at 
a point 16 nucleotides downstream from the 
end of the signal. The addition of the poly(A) 
tail is not template determined and is essential 
for the stability of the mRNA. Of the 1745 
nucleotides from the cap site to the start of the 
poly (A) tail, there are 1092 nucleotides of non¬ 
coding DNA in UTRs and introns, comprising 
63% of the sequence. Although some of this 
non-coding DNA has recognized function, for 
example the polyadenylation signal, it is worth 
emphasizing again that, as for other loci in 
mammals, only a relatively small proportion 
specifies amino acids, and this proportion 
becomes even smaller if all sequence from the 
start of the 5' to the end of the 3' regulatory 
sequences are considered. 


Of course, there are other 5' and 3' regu¬ 
latory sequences that are essential for the 
tissue-specific and temporally regulated expres¬ 
sion of any gene, such as growth hormone. 
In general, these include positive and negative 
proximal promoter elements, which are bind¬ 
ing sites for transcription factors, which either 
assist or hinder directly or indirectly the binding 
of RNA polymerase II. Additionally, there are 
enhancer elements, which are large sequences, 
often quite distant, and either upstream or 
downstream from the gene, which increase the 
level of transcription, and silencer elements, 
which are short, often repeated, sequences 
that downregulate expression. 


Somatically rearranged genes 

Unlike conventional genes, the immunoglobulin 
and T cell receptor loci show extensive patterns 
of somatic rearrangement as a normal part of 
the diversity-generating mechanism, which 
enable the huge range of immune responses 
found in mammals. In essence, gene families of 
constituent parts at these loci are assembled 
by DNA breakage and joining events into more- 
or-less unique combinations within committed 
lineages of immune cells. The control of the 
transcription of these newly assembled func¬ 
tional units and the processing of transcripts 
occurs as for conventional genes. 

The antibody molecules produced by B 
cells comprise light chains fitting into two major 
categories, X and k, and heavy chains, consist¬ 
ing of five major types, a, 5, 8, y and pi, which 
correspond to the five classes of antibodies, IgA, 
IgD, IgE, IgG and IgM. The light-chain families 
map to two locations in human and mouse, 
whereas the five heavy-chain ‘loci’ map to a 
single location, which, as mentioned previously, 
has been identified in the pig. To make a X or 
k light chain, a B cell must undergo a process of 
combinatorial (V-J) joining in which a unique 
combination of four exons is assembled by 
chromosomal breakage and joining events. These 
exons are called L (leader) and V (variable), of 
which there are several hundred adjacent but dif¬ 
ferent pairs, J (joining), of which there are four to 
five variants, and C (constant), of which there is 
normally one copy. Many hundreds of different 
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light chains can be specified in this way. Heavy- 
chain diversity is generated in a similar but 
slightly more complex way. As for light chains, 
there are about 250 different L and V exon 
pairs, followed by about 12 D (diversity) exons, 
followed by about four J exons, followed by 
multi-exonic constant regions corresponding to 
all of the major classes and subclasses of anti¬ 
bodies. The construction of a functional heavy- 
chain gene requires breakage and rejoining 
events to assemble a unique combination of 
L-V-D-J-C coding regions via two events, 
namely D-J joining followed by V-DJ joining. 
Again, many hundreds of different heavy chains 
can be produced in this way, which in combina¬ 
tion with the many hundreds of different possi¬ 
ble light chains yields great antibody diversity. 

In T cells, which are responsible for cell- 
mediated immune responses, the antigen 
receptors are composed of a, (3, y and 5 subu¬ 
nits, which are encoded at different chromo¬ 
somal locations. The normal adult receptor is 
an 6(3 dimer in humans and mice, but y6 dimers 
are more prominent in pigs and are found on 
up to half the T cells. The functional copies of 
these genes are assembled by a joining mecha¬ 
nism similar to immunoglobulin chains, with a 
and y chains having V and J segments, and p 
and 6 having VDJ segments, in addition to the 
constant segments. Thus, both for antibodies 
and for T cell antigen receptors, somatic rear¬ 
rangements are a normal part of the cycle of 
expression of these gene products, necessi¬ 
tated by the enormous diversity of products 
required and the impossibility of encoding all 
possible products in a conventional fashion. 

Although most characterization of these 
immune gene systems has been performed on 
mice and humans, many elements of the proc¬ 
ess have been verified in pigs. Lammers et al. 
(1991) have sequenced most of the V and all of 
the J and C regions of the porcine k and X light 
chain genes and have found that the J region is 
more closely conserved across species than the 
constant regions. The porcine k light chain is 
unusual in that the constant region terminates 
with the dipeptide Glu-Ala after the Cys resi¬ 
due that terminates all other mammalian light 
chains. Kacskovics et al. (1994) have identified 
five putative subclasses of porcine IgG loci (1 , 
2a, 2b, 3 and 4) on the basis of C-region 
sequences from a single animal. Brown and 


Butler (1994) sequenced the constant region of 
the porcine IgA heavy chain and have shown 
that unlike the human and the mouse, which 
have two or more loci, the pig has only a single 
C gene, with the sequence and structure con¬ 
served relative to other species particularly 
towards the carboxy terminus of the gene 
product. Bosch et al. (1992) have similarly 
characterized a porcine clone for the (i heavy 
chain. Sun et al. (1994) have cloned and 
sequenced 65 H chain V regions expressed 
with IgG, IgA or IgM C regions. They found 
that the porcine V regions belonged to a homo¬ 
geneous family with greater than 80% nucleo¬ 
tide identity among members, and that there 
were probably fewer than 20 copies of the V 
regions in the pig. Surprisingly, the J region 
was also very homogeneous, suggesting that a 
single J region is preferentially used. In other 
mammals, there are three families of V genes, 
with mice having 100 to 1000 copies, humans 
100 to 120 copies and rabbits 100 copies. 
The relatively depauperate V status of pigs 
raises questions about how they can generate 
sufficient antibody diversity. 

The cDNA clones of the porcine T cell 
receptor a, p, y and 6 chains have been char¬ 
acterized (Thome et a/., 1993; Grimm and 
Misfeldt, 1994). Thome et al. (1993) found 

evidence for at least three different constant 
segments for the porcine y chain family. There 
is substantial conservation of constant segment 
coding sequence within the four subunits across 
pigs, cattle, mice, sheep and humans. 


Non-coding RNAs (ncRNAs) 

Following the discovery of very large numbers 
of ncRNAs in the mouse transcriptome (The 
FANTOM Consortium and the RIKEN Genome 
Exploration Research Group Phase I & II Team 
et a/., 2002), it is now recognized that most of 
the genome in mammals is transcribed in a 
controlled manner. Indeed, transcription from 
non-coding sequences is at least four times 
greater than from coding sequences (Mercer 
et al., 2009), although individual protein¬ 
coding transcripts are expressed at a much 
higher level than non-coding transcripts. The 
approximately 34,000 or so long non-coding 



90 


C. Moran 


but messenger-like RNA molecules that have 
been discovered in the mouse will presumably 
have orthologues in other mammalian species, 
and perhaps should be thought of as a large 
new class of genes to which we need to ascribe 
function. Opinions vary as to the proportion 
that is functional. However, those that are 
expressed in a highly regulated way, with devel¬ 
opmental and tissue specificity and even pre¬ 
cise subcellular location (Mercer et a/., 2009), 
clearly meet the criteria for functionality, 
despite the fact that there is generally poor 
sequence conservation of these elements 
between species. Pheasant and Mattick (2007) 
have argued that it is quite possible that the 
majority, and possibly all, of these sequences are 
functional. The database RNAdb (Pang et a/., 
2005) catalogues such ncRNAs from various 
species. It is believed that ncRNAs generally 
function as trans-acting regulatory factors. 

Seemann et a/. (2007) have searched for 
ncRNAs in the Sino-Danish PigEST resource, 
and identified over 1000 of them. Consistent 
with findings in other mammals, they found 
that ncRNA transcripts were much more abun¬ 
dant in brain tissue than in other less develop- 
mentally complex tissue. 

Micro RNAs (miRNAs) 

miRNAs are short regulatory RNA molecules 
that are about 22 nucleotides in length. They 
can regulate gene expression by interfering 
with translation or causing degradation of tar¬ 
get mRNA molecules, which they do by base¬ 
pairing with the target molecule, most 
frequently in the 3' UTR. miRNAs arise from 
longer transcripts and are trimmed to size. The 
precursor pri-miRNA is from several hundred 
bp to 1000bp long and is capped, polyadenyl- 
ated and spliced, and may actually incorporate 
a number of different miRNA molecules that 
are released following processing. The pri- 
miRNA contains a number of hairpin loops of 
about 70 nt in length, which are cut and then 
processed into functional miRNA. He et ai 
(2008) have identified miRNAs encoded by 
long ncRNAs, estimating that up to 800 are 
present in the mouse ncRNA population. 

Most miRNAs are initially reported on the 
basis of bioinformatic scans, but given the small 
size of miRNAs these scans are susceptible to 


false discovery. Wernersson et ai (2005) iden¬ 
tified 51 miRNA sequences in their partial por¬ 
cine genome sequence by similarity to miRNAs 
in closely related species, but this necessarily 
excludes miRNAs that are specific to the pig. 
Kim et ai (2006) subsequently reported an 
additional nine miRNAs identified by homology 
search. Reddy et al. (2009) found 120 miRNAs 
using 454 sequencing. Comparisons with 
known miRNAs in miRBASE showed that 24 
were previously recognized porcine miRNAs 
and 96 novel miRNAs. They confirmed expres¬ 
sion of 22 conserved miRNAs and four pig- 
specific miRNAs in a range of pig tissues. 
Huang et al. (2008) identified 775 miRNA 
candidates in silico in the partial porcine 
genome sequence, of which 296 gave positive 
signals on a cross-species miRNA microarray 
using developing skeletal muscle, but only five 
of these were validated by real-time PCR; 255 
of the 296 had not been previously reported. 
McDaneld et al. (2009) have also examined the 
profile of miRNA expression in developing 
porcine skeletal muscle, reporting 12 novel 
potential miRNAs. Sharbati-Tehrani et al. 
(2008) applied a novel concatameric cloning 
method to identify ten porcine miRNAs, of 
which nine were novel. Anselmo et al. (2009) 
identified large numbers of miRNAs in porcine 
cells infected with the Alphaherpesvirus respon¬ 
sible for Aujesky’s disease, of which about 95% 
were of host origin. They were particularly 
interested in identifying miRNAs of viral origin 
that regulated host gene expression in the por¬ 
cine cells, as Herpesviruses have been shown to 
produce such miRNAs. Tomas et al. (2009) 
identified 175 potential miRNAs of viral origin 
in porcine cells infected with Aujesky disease 
virus, along with about 200 miRNAs of host 
origin. Feng et al. (2007) mapped six evolu¬ 
tionary conserved porcine miRNA loci using a 
radiation hybrid panel, demonstrating that the 
porcine map locations were consistent with 
human positions for each miRNA, thus provid¬ 
ing an alternative or at least complementary 
validation tool to expression analysis. miRBASE 
(Release 14) lists 721 confirmed miRNAs for 
humans and only 77 for the pig, implying per¬ 
haps a degree of scepticism about the published 
porcine miRNAs, but also demonstrating that 
there is ample scope for further discovery of 
porcine miRNAs. 
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Of course, the discovery of the miRNA is 
only the first step. The next important step is 
discovering the target for miRNA binding. If 
identifying the miRNAs themselves is challeng¬ 
ing enough using bioinformatic approaches, 
because of the small size of the mature miRNA 
molecules, the challenge for finding the targets 
is much greater. Perfect complementarity is 
only required for the first six to eight nucleo¬ 
tides at the 5' end of the miRNA and the 3' end 
of the target and, in some cases, even this per¬ 
fect match is not absolutely necessary. Liu 
et al. (2010) have pointed out that algorithmic 
methods for detecting miRNA targets have 
false discovery rates of up to 30%. Allelic vari¬ 
ation in miRNA target sites has been associ¬ 
ated with phenotypic effects in domestic animal 
species, although none have been verified in 
pigs. Clop et a/. (2006) reported that a muscu¬ 
lar hypertrophy phenotype in Texel sheep was 
the result of the creation of an illegitimate tar¬ 
get for miR-1 and miR-206 in the 3' UTR of 
the myostatin locus. Davis et a\. (2005) have 
implicated maternal expression of five different 
miRNAs in the callipyge muscular hypertrophy 
phenotype in sheep. Cargill et al. (2008) have 
identified a mutation in the 3' UTR of synapto- 
janin that could disrupt let-7 and miR-98 bind¬ 
ing sites, and is possibly responsible for the 
polled trait in cattle. 

miRNAs have also generated enormous 
interest as it is possible to design inhibitory 
RNA molecules to mimic the inhibitory activity 
of the naturally occurring miRNAs, but with 
the possibility of targeting almost any molecule. 
As described previously, such constructs have 
been used to genetically modify pigs to sup¬ 
press the expression of porcine endogenous 
retroviruses, but clearly they can be used in a 
much broader context. 


Base Composition and Methylation 

Protein-encoding genes in humans are found 
preferentially in GC-rich isochores. The GC 
content of human exons also tends to be about 
10% higher than in introns. Wernersson et al. 
(2005) have found that the GC content and dis¬ 
tribution in the pig are very similar to those in 
humans, as discussed above, with a 10% higher 


GC content in coding regions compared with 
introns. These findings have confirmed the 
results of earlier glimpses of porcine GC con¬ 
tent. For example, the GC content of ten of 
the microsatellite-containing clones (excluding 
the repetitive microsatellite component) char¬ 
acterized by Robic et al. (1994), none of which 
is known to encode any function, was 43% 
(1607 from 3768 nucleotides), also consistent 
with the expectation that porcine non-coding 
regions would be GC poor. In contrast, the 
growth hormone locus has a genomic DNA 
GC content of 61% and a cDNA GC content of 
59% GC. Again the expectation of high GC 
content in coding DNA is met. However, not all 
porcine coding genes have a high GC content. 
The genomic sequence of the relaxin gene, 
containing about lkb of 5' flanking non- 
transcribed sequence, about 300 nucleotides of 
3' untranscribed sequence and a single, very 
large intron of 5495 nucleotides (Haley et al., 
1987), is only 36% GC, and this figure rises to 
only 40% for the relaxin cDNA. 

Variation in nucleotide composition can 
have more subtle manifestations than simple 
variation in nucleotide content. The cytosine of 
the dinucleotide CpG is normally methylated 
throughout vertebrate genomes as part of a 
transcription suppression mechanism, except 
in clusters of this dinucleotide forming so-called 
CpG islands, which somehow escape methyla¬ 
tion and allow expression of the adjacent gene. 
The methylated CpG is particularly prone to 
mutation to TpG. In all vertebrates examined, 
the CpG dinucleotide is about five times less 
frequent than would be predicted on base com¬ 
position, because of this mutational pressure. 


Number of Expressed Genes 

in the Pig 

When the first edition of this chapter was writ¬ 
ten, no estimate had been made of gene 
number in the pig, nor indeed had any accu¬ 
rate count of gene number been made for any 
species. The generally accepted figure for ver¬ 
tebrates at that time was about 50,000- 
100,000, regardless of variation in genome 
size. Antequera and Bird (1993), on the basis 
of there being an estimated 45,000 and 
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37,000CpG islands in the human and mouse 
genomes, respectively, and that 56% of human 
genes and 47% of mouse genes are preceded 
by CpG islands, came up with an overall esti¬ 
mate of gene number in both species of about 
80,000. Numerous other estimates of gene 
number were bandied about for humans in par¬ 
ticular, and mammals in general, going as high 
as 150,000. These inflated numbers were 
brought down to earth with a thump with the 
completion of the first human genome 
sequences, which clearly showed that the 
number of protein-encoding genes, plus the 
already recognized infrastructural RNAs, such 
as rRNAs and tRNAs, was about 20,000. This 

number was subsequently verified for all other 
mammals for which complete sequences 
became available. The current best, but cer¬ 
tainly downwardly biased, estimate of gene 
number in the pig is 18,621 (Seemann et al. 
2007) and is based on the content of the 
PigEST resource database in 2007. It is to be 
expected that this will drift slightly upwards 
towards the generally accepted value for all 
mammals as the final elements of the porcine 
genome sequence come to light. 

Ironically, the more recent recognition 
that almost the entire genome is transcribed, 
and that there are up to 34,000 non-coding 
but quite possibly functional RNAs in mam¬ 
mals, means that both the concept of gene 
number and the actual gene count require seri¬ 
ous rethinking. Crudely, one might wish to 
simply define the ncRNAs as originating from 
genes and add their number to the count for 
protein-coding sequences, giving an overall 
figure of greater than 50,000. However, this is 
a gross oversimplification, ignoring the com¬ 
plexities of the interleaved model of genome 
structure. What do you count as a gene, for 
example, when a mRNA is transcribed from 
one strand of DNA and a miRNA is transcribed 
from the other and regulates expression of the 
protein encoded by that mRNA? 


Mitochondrial Genome 

Mitochondria are cellular organelles believed to 
be derived from ancient intracellular bacterial 
symbionts. In addition to their unique role in 


synthesis of ATP (oxidative phosphorylation), 
these organelles have a very peculiar genetic 
system. They have an autonomously replicat¬ 
ing DNA genome, of about 16.5 kb in mam¬ 
mals, but the mechanism of replication is quite 
unlike that for nuclear DNA. Replication 
commences with initiation of heavy (H)-strand 
synthesis at a specific origin and results in the 
formation of a displacement (D) loop with a 
newly synthesized H strand of about 680 bases, 
with very few H strands achieving full length. 
Initiation of light (L)-strand synthesis occurs 
from a specific origin but only after this region 
has been exposed by H-strand synthesis. 
Transcription is initiated from promoters situ¬ 
ated within the D-loop region, with the primary 
transcripts processed to give 12S and 16S 
rRNAs, tRNAs and a number of mRNAs which 
are not capped but are polyadenylated. Virtually 
all of the DNA in the mitochondrial genome is 
functional, specifying rRNA, tRNA or mRNA, 
with very few, and in some cases no, non-coding 
nucleotides between adjacent genes. The 
D loop does not encode any known function. 

Most proteins in the mitochondria are 
nuclear encoded but some are mitochondrially 
encoded and translated by a separate system in 
which the rRNAs and tRNAs are also mito¬ 
chondrially encoded. In the mammalian mito¬ 
chondria, proteins are synthesized using a 
unique mitochondrial code, which employs dif¬ 
ferent initiation and stop codons from the uni¬ 
versal code, and messages are decoded by a 
different mechanism from that used by nuclear 
genes. This decoding mechanism uses a mini¬ 
mal set of only 22 tRNAs with different wobble 
anticodons from those employed in nuclear 
protein synthesis. Formyl-methionine is used to 
initiate a peptide, as in prokaryotes. However, 
mammalian mitochondrial rRNA has only low 
homology with prokaryotic rRNA. 

The complete mitochondrial genome 
sequence has been available for several mam¬ 
malian species, including human (Anderson 
et a/., 1981), cattle (Anderson et al., 1982), 
mouse (Bibb et al., 1981) and rat (Gadaleta 
et al., 1989), for many years. The first com¬ 
plete mitochondrial sequences for pigs became 
available in 1998 (Ursing and Arnason, 1998; 
Lin et al., 1999; Kijas and Andersson, 2001). 
Earlier mitochondrial sequence analyses for 
the pig were restricted to the cytochrome 
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b gene (Irwin et a/., 1991; Honeycutt et a/., 
1995) and the cytochrome c oxidase II gene 
(Honeycutt et a/., 1995), where the porcine 
sequences were generated as part of a large 
phylogenetic study. Additionally, the D loop 
region (Ghivizzani et ah, 1993) of the pig has 
been extensively characterized. The porcine 
D loop is much larger than in the cow and, 
surprisingly, displays greater organizational 
similarity to the mouse and human D loop 
than to the bovine D loop. Within the porcine 
D loop, there is a ten-nucleotide repeat, with 
copy number ranging from 14 to 29, giving 
rise to the highest level of mitochondrial 
heteroplasmy documented in mammals, 
apparently as a result of replication slippage 
in the tandem repeat. Mitochondrial sequences 
have been used extensively for phylogenetic 
studies of Sus scrofa and its relatives (Giuffra 
et ah, 2000; Kijas and Andersson, 2001; 
Okumura et ah , 2001; Kim et ah, 2002; 
Gongora et ah , 2004; Larson et ah , 2005; 
Lucchini et ah , 2005), revealing among other 
things the separate domestication of pigs in 
Europe and Asia. 

Nuclear mitochondrial sequences (numts) 
comprise an interesting class of pseudogene. In 
humans, there is an unusually large number, 
estimated at 350-612 copies comprising about 
0.012% of the genome, which have been trans¬ 
ferred into the nuclear genome (www. 
pseudogene.net). Domestic cats also have large 
numbers of numts, between 38 and 76. 
Undetected numts can compromise phyloge¬ 
netic analyses of mitochondrial DNA (mtDNA) 
sequences as their evolutionary history will be 
quite distinct from the conserved functional 
mtDNA, although phylogenetic signals can be 
obtained from the presence or absence of numts 
in some lineages. Numts have not been reported 
in S. scrofa , although Funk et al. (2007) found 
evidence for a cytochrome B numt in the pygmy 
hog, Sus (Porcula) saluanius. 


Conclusions 

This review of the molecular genetics of the 
pig has aimed to cover studies specific to the 
pig, but to put this information in the broader 
context of the genomic structure and function 


common to all mammals. It is very clear that 
the pig, like all mammals, has only a small 
proportion of its genome of around 2% 
dedicated to encoding proteins. Formerly, 
much if not most of the remainder was often 
dismissed as junk DNA with no function. With 
the recognition of genome-wide transcription 
and large numbers of transcriptionally regu¬ 
lated and apparently functional non-protein¬ 
coding RNAs, a more sophisticated view of 
genome structure, gene number and func¬ 
tional genome content is beginning to 
develop, which increases the challenge of 
functionally cataloguing the entire genome of 
the pig. We received small relief in this 
Herculean task with the recognition that the 
number of protein-coding genes is much 
smaller than originally expected in the pig 
and other mammals, but this was largely 
taken back with the recognition of the huge 
number of ncRNA genes, most of which 
remain functionally uncharacterized. 

The protein-coding component of the 
genome of the pig is already almost completely 
characterized, and this task will soon be com¬ 
plete. The recognition of allelic variants (SNPs) 
is well under way. The molecular tools for 
understanding and exploiting genetic variation 
in economically important traits in pigs are 
largely in place, but we still have a long way to 
go in understanding the control of genetic vari¬ 
ation in complex traits. 
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Introduction 

The immune system protects swine against 
infection and coordinates immune responses 
to an unknown array of foreign antigens, vac¬ 
cines and pathogens. Any meaningful attempt 
to correlate the classical genetics of disease 
resistance of swine with differences in immune 
system genetics must consider the genetic ori- 

©CAB International 2011. The Genetics of the Pig, 
2nd Edn (eds M.F. Rothschild and A. Ruvinsky) 


gin of innate and adaptive immune receptors 
(see reviews in Summerfield, 2009). The 
immune system adds an additional level of 
complexity to understanding genetics as a 
result of the differentiation of lymphocytes, 
i.e. the T cells and B cells, and their receptors. 
Unlike other somatic cells, lymphocytes utilize 
a combination of genetic mechanisms, 
including recombination-activating gene (RAG) 
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protein-mediated somatic recombination, 
gene segment rearrangement, junctional 
diversity and somatic mutation mechanisms 
to generate a vast repertoire of T cell recep¬ 
tor (TCR) and B cell receptor (BCR) pheno¬ 
types with which to sample the antigenic 
environment (Table 6.1) (Paul, 2008; Barreiro 
and Quintana-Murci, 2010). Immunoglobulin 
(Ig) serves as the surface antigen receptor for 
B cells, i.e. the BCR. Recognition of antigen 
is through the variable regions of both the 
heavy and light Ig polypeptide chains that 
comprise the BCR. Without the use of a 
combination of genetic recombination mech¬ 
anisms, the TCR and BCR repertoire (>10 10 
for each) would require more DNA than is 
available in the entire typical mammalian 
genome (Venturi et a/., 2008). Immune 
receptors like BCRs and TCRs provide an 
additional genetic challenge; they are the 
result of somatic hypermutation (SHM) that 
is regulated by genes thought to follow 
Mendelian genetics like most genes that con¬ 
trol innate immune receptors. 

An equally important genetic element is 
the major histocompatibility complex (MHC), 
which is also highly complex, not due to 
somatic recombination and mutation events, 
but rather resulting from the large number of 
allelic variants at many of the >300 loci in 
the complex (Horton et a/., 2004; Kelley 
et a/., 2005; Paul, 2008; Marsh et a/., 
2010). The human leucocyte antigen (HLA) 
is recognized as the most variable region in 
the human genome (Horton et a/., 2008). 
The MHC antigens, in swine the ‘swine leu¬ 
cocyte antigens’ (SLAs), are vital to the over¬ 
all immune response because of their role in 
binding and presentation of foreign antigens 


to the TCR. The allelic variants at these 
loci serve to increase the range of foreign 
antigens that can be recognized and pre¬ 
sented to the TCR. The major class I MHC 
antigens are expressed on all cells, whereas 
the class II MHC antigens are primarily 
expressed on the surface of so-called profes¬ 
sional antigen-presenting cells (APCs), such 
as B cells, macrophages and follicular den¬ 
dritic cells (Schook and Lamont, 1996; 
Lunney et a/., 2009). Other loci in this region 
serve to process and transport foreign anti¬ 
gens to the MHC molecule for effective pres¬ 
entation to the TCR. Additional ‘non-classical’ 
MHC molecules, such as HLA-G (human leu¬ 
cocyte antigen-G), present ligands to natural 
killer immunoglobulin receptors (KIRs), which 
themselves are very polymorphic receptors deter¬ 
mining innate immune responses (Table 6.1) 
(Middleton and Gonzelez, 2010). The single 
pig KIR gene, KIR2DL1 , maps to SSC6q 
where it is linked by LILRA , LILRB and 
LILRC, members of the leucocyte immuno¬ 
globulin-like receptor (LILR) family, innate 
immune receptors with immunomodulatory 
functions (Sambrook et a/., 2006). 

It is sometimes difficult for young animal 
scientists to understand differences between 
the genetics of adaptive immune receptors and 
those controlling simple Mendelian traits. The 
genetic complexity provided by loci encoding 
the MHC, TCR or BCR is the basis for the 
subtitle 'the Science of Self-Non-self 
Discrimination ’ adopted by Jan Klein in his 
classic immunology textbook (Klein, 1982). 
This chapter will focus on the genetics of the 
porcine MHC, TCR and BCR (Table 6.1), as 
well as the many genetic loci that regulate 
innate and adaptive immune responses. 


Table 6.1. The genetic diversity of the antigen receptors of the adaptive immune system. 


Receptor 

Origin of specificity 

Origin of genetic diversity 

Germ line 

Somatic 

Rearrangement 

SHM 

Polygenic 

B cell receptor (BCR) 

— 

+ 

+ 

+ 

+ 

T cell receptor (TCR) 

— 

+ 

+ 

— 

+ 

Major histocompatibility 

+ 

— 

— 

— 

+ 

complex (MHC) 1 






MHC II 

+ 

— 

— 

— 

+ 

Non-classical MHC 

+ 

— 

— 

— 

± 
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Overview of the Major Antigen 
Receptors of the Immune System 

Innate versus adaptive immunity, 
essential genetic components 

To understand immunogenetics it is first neces¬ 
sary to understand the difference between the 
immediate, or innate, immune response and 
the delayed acquired, or adaptive, immune 
response (Paul, 2008; Barreiro and Quintana- 
Murci, 2010). Innate responses exhibit broad 
antigen specificity and have fixed receptors 
that are the first line of defence against bacterial 
and viral infections. These receptors recognize 
ligands, such as bacterial lipopolysaccharide 
(LPS), bacterial nucleic acids and bacterial flagellin 
DNA, that are collectively called pathogen- 
associated molecular patterns (PAMPs). Many 
have leucine-rich repeat (LRR) domains, 
which are indispensable for recognition of 
PAMPs. Most of these innate receptors are not 
lymphocyte restricted but are found on many 
cell types, and especially on APCs. The adap¬ 
tive response is exquisitely antigen specific; the 
TCRs and Igs are encoded in the germ line; but 
antigen-dependent responses require gene 
rearrangements and result in the development 
of positive memory with enhanced recall 
responses (Lunney et a/., 2004; Paul, 2008). 
Innate immunity is regulated by effector pro¬ 
teins such as antimicrobial peptides, acute- 
phase proteins, complement, cytokines and 
chemokines expressed by monocytes, macro¬ 
phages, granulocytes, neutrophils and natural 
killer (NK) cells. Acquired immunity is regulated 
for each cell (T cell, B cell, APC, dendritic 
cell (DC)) by cytokines and chemokines, Igs, 
perforins and granzymes, and regulatory cell 
subset interactions (Ezquerra et a/., 2009; 
Gerner et a/., 2009; Summerfield and 

McCullough, 2009). 


Genetic diversity of innate 
immune receptors 

Based on comparative studies, several families 
of innate immune receptors were identified, 
e.g. Toll-like receptors (TLRs), first found as 
developmental^ important for Drosophila 


(Lemaitre et a/., 1996; Medzhitov and 

Janeway, 2000; Takeda et a/., 2003). There 
are ten or more TLR molecules that recognize 
complex patterns of molecules derived mainly 
from microbes; genes corresponding to all ten 
TLRs have been cloned in pigs (Table 6.2) 
(Shinkai et a/., 2006a; Zhu et a/., 2008; 
Uenishi and Shinkai, 2009). Many TLRs 
(TLR1, TLR2, TLR4, TLR5, TLR6, TLR 10) 

are localized on the cell surface and recognize 
compounds derived mainly from microbes: 
TLR2 senses Gram-positive bacteria through 
molecules such as peptidoglycans, lipoteichoic 
acid and zymosan from yeasts or fungi; TLR4 
monitors Gram-negative bacteria by recogniz¬ 
ing LPS; TLR5 recognizes microbial flagellin. 
Most TLR molecules form homodimers to 
transmit signals; however, TLR2 constructs a 
heterodimer with TLR6 or TLR1 to bind di- 
and tri-acylglycerides, respectively. The TLRs 
that are expressed on the membranes of intra¬ 
cellular organelles such as endosomes recog¬ 
nize nucleic acids or derivatives of nucleotides 
and are thought to be involved in antiviral 
defence. TLR3 recognizes double-stranded (ds) 
RNA; TLR7 and TLR8 recognize single- 
stranded RNA (ssRNA); TLR9 is a receptor for 
unmethylated CpG DNA, which is generally 
found in bacterial or viral genomes. No tran¬ 
script corresponding to murine TLR11, TLR 12 
or TLR 13 has yet been reported. A thorough 
review of porcine TLRs has recently been pub¬ 
lished by Uenishi and Shinkai (2009). 

Shinkai et al. (2006a,b) reported single 
nucleotide polymorphisms (SNPs) in genes for 
TLR1, TLR2, TLR4, TLR5 and TLR6 by 
screening samples from 11 different pig meat 
breeds. They detected 136 SNPs in these five 
porcine TLR genes, which included 63 non- 
synonymous SNPs, i.e. SNPs causing amino 
acid substitutions. Bergman et al. (2010) 
reported that the numbers of SNPs detected in 
TLR1 and TLR2 were significantly lower in 
the wild boars than in the domestic pigs. The 
authors suggest that this may be because of a 
bottleneck before domestication in the European 
wild boar population, resulting in less diversity in 
European versus Asian pigs in both wild and 
domestic populations (Larson et a/., 2005). For 
TLR1 , TLR2 and TLR6 genes, SNPs were 
disproportionately located in the sequences 
encoding TLR ectodomains; non-synonymous 
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SNPs were more likely to be heterozygous than 
SNPs in other regions (Shinkai et a/., 2006b). 
For TLR1, TLR4 and TLR5 in pigs, cattle and 
humans, SNPs affecting the coding amino acids 
are concentrated in the sequences encoding 
ectodomains, particularly the regions encoding 
LRRs (Uenishi and Shinkai, 2009). 
Polymorphisms in TLR3, TLR7 and TLR8 were 
observed particularly in their ectodomains 
(Morozumi and Uenishi, 2009). Comparative 
TLR studies have predicted disease effects (Jann 
et a/., 2009). Numerous infections alter the 
expression of TLRs, e.g. porcine reproductive 
and respiratory syndrome virus (PRRSV) infec¬ 
tion alters mRNA expression of TLR2, TLR3 , 
TLR4, TLR7 and TLR8 in at least one lymph¬ 
oid tissue or cells (Liu et al., 2009). Examples of 
TLR involvement in pig pathogen associated 
responses are given in Table 6.2. 


The NLR (nucleotide-binding domain, 
LRR-containing) proteins are central regulators 
of immunity and inflammation; several NLRs 
activate the inflammasome complex and result 
in inflammatory cytokine expression, regulation 
of NFkB (nuclear factor kappaB) activation, etc. 
(Ting et al, 2010). Jozaki et al. (2009) investi¬ 
gated the functional variance of missense 
polymorphisms in ligand recognition by the 
porcine NLR family gene, NOD2. The T1949C 
polymorphism, located in the region encoding 
the hinge domain of the molecule, diminished 
the functional response of porcine NOD2 to 
muramyldipeptide (MDP), whereas the A2197C 
polymorphism, in the LRR region, significantly 
augmented the response of porcine NOD2 to 
the ligand (Table 6.2). Jozaki et al. (2009) 
noted that the 1949C allele was rare among 
pig breeds, suggesting that it is a disadvantage 


Table 6.2. Pathogen responses associated with TLRs (Toll-like receptors) and NLRs (nucleotide-binding 
domain, LRR (leucine-rich repeat)-containing proteins) in pigs. Adapted from Uenishi and Shinkai (2009). 


Gene 

Map location 3 (SSC: 
position in Mb) 

Effect on immune response 

Reference 

TLR2 

8: 30.3 

Increased expression of TLR2 in the 
intestinal tissues of gnotobiotic pigs after 
treatment with Escherichia coli, 
Lactobacillus fermentum 

Willing and Van 
Kessel, 2007 

TLR2, TLR6 

8: 83.6 

8: 30.3 

Recognition of Mycoplasma hyopneumo- 
niae cell bodies in porcine alveolar 
macrophages 

Muneta et al., 2003 

TLR3 

15:58.9 

Activation of TLR3 - increased expression 
of IFN (interferon) a, decreased PRRSV 
(porcine reproductive and respiratory 
syndrome virus) infection of 
macrophages 

Miller et al., 2009 

TLR5, TLR9 

10: 14.9 

13:39.7 

Upregulation after feeding of Salmonella 
enteritica enteritica serovarTyphimurium 
or S. e. e. serovar Choleraesuis 

Burkey et al., 2007 

TLR7 

X: 9.3 

Enhanced transcriptional activation of 
TLR-7-induced genes in swine 
macrophages 

Fernandez-Sainz 
et al., 2010 

TLR7, TLR8 

X: 9.3 

X: 9.4 

NK (natural killer) cells activated by TLR 
agonists cytotoxic against foot-and- 
mouth disease virus-infected cells 

Toka et al., 2009 

TLR9 

13:39.7 

Porcine circovirus 2 infection, CpG-TLR9 
signalling cytokine inducers and 
inhibitors 

Hasslung et al., 2003; 
Vincent et al., 

2007; Wikstrom 
et al., 2007 

NODI 

18:40.8 

Immunobiotic lactic acid bacteria 

Tohno etal., 2008a,b; 

NOD2 

NR b 

MDP (muramyldipeptide) response 

Jozaki et al. 2009 


a Jann et al., 2009. 
b NR = not reported. 
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to pig responses to microbes; in contrast, the 
2197C allele was widely distributed among 
Western breeds, thus suggesting a causal rela¬ 
tionship between molecular function and poly¬ 
morphisms in pattern-recognition receptors 
(FRRs). The authors concluded that the 2197C 
allele would confer an advantage in modern pig 
breeds and serve as a useful marker of improved 
disease resistance. 

The structure of other innate receptors is 
diverse (Fritz et a/., 2006; Pancer and Cooper 
2006). Innate immune receptors are often ‘adju¬ 
vant receptors’ that control the activities of 
accessory cells of the immune system and thus 
allow adaptive immune receptors to function 
(McKee et a/., 2010). Among the best known 
are products of mycobacteria used in Freund’s 
adjuvant and synthetic deoxyoligonucleotides 
(ODNs), e.g. CpG-ODN. For example, newborn 
piglets are unable to mount adaptive immune 
responses unless they first encounter PAMPs 
provided by colonizing bacteria or bacterial 
DNA because CpG-ODN is recognized by 
innate immune receptors such as TLR9 (Butler 
et a/., 2002, 2005a). Uenishi and Shinkai 
(2009) predicted the use of TLR ligands as 
immune modulators, particularly for T and B 
cell responses. The high frequency of SNP 
variations among TLR alleles will affect ligand 
recognition and thus lead to differences in path¬ 
ogen responses among pig populations. 

Overall, the receptors for the innate immune 
system are largely encoded in the germ line; thus 
under classical Mendelian genetic inheritance. In 
any case, whether an immune receptor is innate 
or adaptive, its function depends on expression 
and expression depends on a complicated series 
of germ line encoded regulatory genes. So func¬ 
tional swine genetics will be reflected by a sym¬ 
phony of genetic influences, some somatic, 
some rigidly germ line, and all of which are 
under the control of regulatory germ line genes. 


Germ line versus somatic generation of 
B andT cell receptors (BCRs andTCRs) 

TCRs and BCRs are not encoded in the germ 
line but are the result of somatic gene rearrange¬ 
ment and, in the case of BCRs, also of somatic 
hypermutation (SHM), which is mediated by 


activation-induced cytidine deaminase (AID) 
(Venturi et a/., 2008; Thomas et a/., 2009). 
The somatic processes that control BCR and 
TCR generation are controlled by nuclear 
enzymes such as recombination-activating 
enzyme (coded by RAG), terminal deoxynucleo- 
tidyl transferase (Tdt), AID and various constitu¬ 
tive DNA repair enzymes. Except for certain 
constant region genes, receptor formation and 
the expression of BCRs and TCRs depend pri¬ 
marily on regulatory genes, not on germ line 
genes. Thus, one may predict that only the reg¬ 
ulator genes controlling the somatic process 
ascribe to classical Mendelian genetics. However, 
such a simplistic demarcation is itself an over¬ 
simplification (Flajnik and Du Pasquier, 2004). 
For example, the BCRs of fetal B cells that lead 
to ‘natural antibodies’, and the invariant y/5 
TCRs that dominate fetal life might be consid¬ 
ered innate receptors. It seems better that the 
distinction should be based on function, not 
structure. 

In addition to being exclusively germ line 
encoded, TLRs and other families of innate 
immune receptors, as well as MHC antigens, 
are controlled by codominant genes, and both 
alleles can be expressed on any one cell. This 
sharply differs from TCRs and BCRs expressed 
on lymphocytes, for which gene expression is 
controlled by allelic exclusion, i.e. only the 
gene products of one allele are expressed on 
any one T cell or B cell; thus, lymphocytes are 
monospecific. This evolutionary development 
is easy to comprehend because a B or T cell 
with dual receptors might recognize both a 
dangerous pathogen and one’s own self¬ 
antigen. 


The Porcine MHC or Swine Leucocyte 
Antigen (SLA) Complex 

Organization of the SLA complex genes 

The swine MHC, the SLA complex, is one of 
the most gene-dense regions in the swine 
genome. It consists of three major gene clus¬ 
ters, the SLA class I, class III and class II 
regions, that span —1.1, 0.7 and 0.5Mb, 
respectively, on chromosome 7 (SSC7pl.l- 
ql.l), making the swine MHC the smallest 
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among mammalian MHC so far examined, 
and the only one known to span the centro¬ 
mere (Fig. 6.1) (Chardon et a/., 2000; Lunney 
et a/., 2009). The entire SLA region of the 
very common Hp-1.1 (H01) haplotype has 
been mapped and sequenced (Velten et a/., 
1998; Chardon et a/., 2001; Renard et a/., 
2001, 2003, 2006; Shigenari et a/., 2004); a 


total of >230 loci have been identified, with at 
least 190 genes predicted to be functional 
(VEGA Pig, 2010). 

The MHC genes are critical for generation 
of most adaptive immune responses. Initiation 
of adaptive immune responses to most protein 
antigens, the thymus-dependent (TD) antigens, 
requires dual recognition. The BCR recognizes 
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Fig. 6.1. Comparative genomic organization of the human and swine major histocompatibility complex 
(MHC) class I and class II regions. The human leucocyte antigen (HLA) class I and II map is adapted 
from Horton et al. (2004) and the swine leucocyte antigen (SLA) class I and II map is based only on one 
fully sequenced haplotype (Hp-1.1, H01) (Renard et al., 2006). Note: not all the genes are shown here 
and the scale is approximate. The number and location of expressed SLA class I or II genes may vary 
between haplotypes. Adapted from Lunney et al. (2009). 
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the intact foreign antigen, but the full-blown B 
cell response that results in secretion of antibod¬ 
ies requires a second signal delivered from a 
TCR-dependent T cell response. The TD anti¬ 
gen is processed into peptides that are pre¬ 
sented to T cells by MHC class II antigens. The 
MHC class I antigens bind eight to ten amino 
acid peptides, generated by intracellular break¬ 
down of self and foreign proteins, and stimulate 
adaptive cytotoxic (CD8+ T cell) immune 
responses when they present these peptides to 
the TCR (Yaneva et a/., 2010). The het- 
erodimeric proteins (SLA class I and (3 2 - 
microglobulin) are constitutively expressed on 
the surface of virtually all nucleated cells and 
function mainly in presenting peptides to CD8+ 
cytotoxic T cells. They also interact with NK 
cells to prevent NK-mediated cytotoxicity 
(Kwiatkowski et a/., 1999). Non-classical MHC 

I receptors are less or non-polymorphic recep¬ 
tors and recognize other chemistries, including 
glycolipids derived from bacteria. The SLA class 

II antigens are expressed mainly on APCs and 
function in presenting exogenous peptides to 
CD4+ helper T cells (Ezquerra et a/., 2009). 

SLA class I and II genes 

The SLA map includes seven classical class I 
genes and three non-classical class I genes 
pig. 6. la). The overall genomic organization of the 
SLA class I region is quite different from that of 
the HLA class I region, yet they share common 
orthologous segments with strong conserva¬ 
tion in the order and organization of the genes. 
Most differences appear to be due to segments 
where HLA expansion has occurred (Renard 
et a/., 2003; Kelley et a/., 2005). The consti¬ 
tutively expressed classical SLA class I genes 
are SLA-1, SLA-2 and SLA-3 , while the rest 
are pseudogenes. Increasing evidence also 
suggests that some SLA haplotypes have a 
duplicated SLA-1 locus, designated SLA-1' 
(Smith et a/., 2005a; Ho et a/., 2009a). 
Because the SLA class I genes are encoded by 
three loci that means that potentially six differ¬ 
ent genes can be expressed in a heterozygous 
individual. Indeed, Tanaka-Matsuda et al. 
(2009) sequenced the genomic region carrying 
SLA classical class I genes of an SLA I hetero¬ 
zygous pig (Hp-28.0/Hp-62.0) and reported 


up to six classical class I genes in Hp-28.0, 
including an additional SLA-1 (-like) gene, 
newly designated SLA-12. Thus the SLA com¬ 
plex has copy number variance for the classical 
class I loci; this varies depending on SLA hap- 
lotype (Hp) (Tanaka-Matsuda et a/., 2009). 
The non-classical class I genes, SLA-6, SLA-7 
and SLA-8 , are located at the centromeric end 
of the class I region. The SLA class I region 
also harbours the MHC class I chain-related 
genes (MIC); MIC2 is predicted to be functional 
while the MIC1 gene appears to be a pseudo¬ 
gene (Fig. 6.1a) (VEGFA Fig, 2010). 

Phylogenetic analyses showed that the 
SLA class I genes displayed much more sequence 
homology to each other than to the HLA class 

I genes (Renard et a/., 2003; Smith et a/., 
2005a; Ho et a/., 2009a,b). SLA class I genes 
were named with numbers to avoid the implica¬ 
tions that any of these loci were more homolo¬ 
gous to the HLA-A, HLA-B or HLA-C genes of 
the HLA system. Sequence comparisons indi¬ 
cated that the SLA-1 and SLA-3 genes are 
more similar to each other, as is the SLA-1', 
than they are to SLA-2. Therefore, it is likely 
that these arose as gene duplications after the 
separation of primates and Artiodactyla. 

The loci encoding the expressed SLA class 

II antigens include the a- and (3-chain genes for 
the SLA-DR, SLA-DQ, SLA-DM and SLA-DO 
proteins; genes are linked in tandem on SSC7 
(Fig. 6.1a) (Renard et a/., 2003; Smith et al., 
2005a; Ho et al., 2009a). From the most cen¬ 
tromeric SLA-DR A gene in the class II gene 
cluster, the order of the expressed SLA genes is 
DRB1, DQA, DQB1, DOB1, DMB, DMA and 
DOA (Fig. 6.1b). In contrast to the HLA system, 
there are no loci encoding the DP proteins. 
There are several class II (3-chain pseudogenes in 
the SLA class II region, including DRB2, DRB3, 
DRB4, DRB5, DQB2, DOB2 and wDYB. The 
expressed SLA class II antigens are designated 
as SLA-DR and SLA-DQ based on their homol¬ 
ogy to the human molecules. They are expressed 
primarily on the surface of professional APCs, 
such as macrophages, B cells and dendritic cells, 
as well as capillary endothelia (Wilson et al., 
1996; Chamorro et al., 2000; Summer field 
etal., 2003; Ezquerra etai, 2009; Summerfield, 
2009). Unexpectedly, normal swine T cells 
express SLA class II antigens, with preferential 
expression on the CD 8+ T cell subset (Pescovitz 
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et al., 1985; Saalmiiller et a/., 1991; Saalmiiller 
and Maurer, 1994). SLA class II antigens func¬ 
tion mainly in presenting exogenous 10-18 
amino acid peptides to CD4+ helper T cells. 

Genes involved in the antigen presentation 
pathway, such as the transporter associated 
with antigen processing (TAP) genes ( TAPI 
and TAP2) and proteasomes (PSMB8 and 
PSMB9), are also located in the class II region 
between the DOB1 and DMB loci. The overall 
genomic organization of SLA class II region is 
well conserved with HLA, but the length of the 
SLA class II region is much shorter. Phylogenetic 
analyses also showed that the SLA class II genes 
demonstrated much stronger sequence homol¬ 
ogy with their HLA counterparts than they do 
with each other (Smith et a/., 2005b). As a 
result, the functional SLA class II genes were 
named after their human counterparts to indi¬ 
cate the orthology between the two systems. 

The SLA class III region is centromeric 
and physically linked to the contiguous class 
I region. Over 60 loci have been characterized 
in this region, including many important genes 
in the immune defence mechanism, such as 
the tumour necrosis factor (TNF) gene families 
(TNF, LTA and LTB), the steroid cytochrome 
P450 21-hydroxylase (CYP21) enzyme and 
components of the complement cascade (C2, 
C4A and CFB) (Brule et a/., 1996; Peelman 
et a/., 1996; Renard et al., 2006). 


SLA gene and haplotype polymorphisms 

One of the most remarkable features of the 
MHC genes is the extremely high degree of 
genetic polymorphism within loci. The human 
MHC Haplotype Project affirmed that they are 
the most polymorphic genes in the vertebrate 
genomes, with 300 total loci, including 122 
gene loci with coding substitutions, of which 97 
were non-synonymous (Horton et a/., 2008). 
The Immuno Polymorphism Database-MHC 
(IPD-MHC) web site (http://www.ebi.ac.uk/ 
ipd/mhc/sla/) serves as the repository for main¬ 
taining a list of all recognized SLA genes and 
their allelic sequences. The Nomenclature 
Committee for Factors of the SLA System 
regularly updates SLA nomenclature and desig¬ 
nates alleles of each locus into groups based on 


sequence similarity (identification of ‘group- 
specific’ polymorphic sequence motifs) (Smith 
et al ., 2005a; Ho et al., 2009a). The allelic group 
assignments were based primarily on polymor¬ 
phisms in the exon 2 and 3 sequences for class 

1 alleles and in the exon 2 sequences for class II 
alleles, given that these regions encode the pep- 
tide-binding domains, as well as interacting 
directly with the immune cell receptors, and are 
therefore considered to be functionally vital (Ho 
et al., 2009b, 2010). Given the strong linkage 
disequilibrium exhibited for SLA loci, haplotypes 
have been designated for specific combinations 
of alleles of class I and II genes; these were first 
defined by high-resolution DNA sequencing 
(Smith et al., 2005a; Ho et al., 2009a). These 
high-resolution SLA haplotypes are named with 
a prefix ‘Hp-’, and a number for the class I hap¬ 
lotype followed by a number for the class II hap¬ 
lotype separated by a period (e.g. Hp-1.1). 

Based on the IPD-MHC SLA database, a 
total of 116 SLA classical class I alleles and 13 
non-classical class I alleles have been identified to 
date; the SLA-1, SLA-3 and SLA-2 genes are 
highly polymorphic (Smith et al., 2005b; Ho 
et al., 2009b). There are 12 SLA-1 allele groups 
with a total of 44 alleles; seven SLA-3 allele 
groups with 26 alleles, and 14 SLA-2 allele 
groups with 46 alleles. The extreme polymorph¬ 
isms of the SLA class I genes are, as expected, 
concentrated in exons 2 and 3 of the coding 
regions which form the class I protein peptide¬ 
binding groove. The non-classical SLA-6 gene 
appears to be largely monomorphic. 

There are a total of 167 SLA class II alleles 
identified to date (128 (3-chain, 39 a-chain alle¬ 
les), with polymorphisms mainly located in exon 

2 of the coding sequences (Smith et al., 2005b; 
Ho et al., 2009b; IPD-MHC web site). The 
SLA-DRB1 and SLA-DQB1 loci display a very 
high degree of polymorphism. There are 14 
DRB1 allele groups with a total of 82 alleles, 
and nine DQB1 allele groups with 44 alleles. 
The SLA-DQA locus exhibits a moderate degree 
of polymorphism, with 20 alleles and five allele 
groups identified to date. As with HLA-DRA, 
the SLA-DRA locus exhibits a very limited poly¬ 
morphism, with 13 alleles representing three 
allele groups, despite the fact that it also encodes 
part of the domain for binding antigenic pep¬ 
tides. Ando et al. (2001) characterized the DNA 
sequence of five SLA-DMA alleles, which 
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showed only a few nucleotide substitutions in 
exon 3 and exon 4 of their coding regions. 

Originally, SLA haplotypes were serologi¬ 
cally defined; a total of 72 SLA class I haplo¬ 
types (designated H01-H72) were reported, 
reflecting mostly European commercial pig 
breeds (Renard et ai, 1988; Chardon et a/., 
2000). To date, a total of 29 SLA class I haplo¬ 
types and 21 SLA class II haplotypes have been 
defined by means of high-resolution DNA 
sequencing. With FCR-sequence-specific primer 
(SSP)-based SLA typing methods, a total of 49 
class I and 30 class II SLA haplotypes have so 
far been identified after testing nearly 850 pigs 
obtained from multiple commercial sources (Ho 
et ai, 2009b, 2010). This apparently low SLA 
diversity in commercial pigs may be the result of 
selection and resultant selective breeding for 
desirable production traits. 

Based on earlier data, there is substantial 
linkage disequilibrium, the non-random associa¬ 
tion of alleles at adjacent loci, for the SLA com¬ 
plex genes; the overall recombination frequencies 
were reported to be 0.4-1.2% within the SLA 
region and 0.05% within the class I region 
(Vaiman et a/., 1978; Pennington et a/., 1981; 
Edfors-Lilja et a/., 1993; Smith et ai, 1995; 
Chardon et ai, 2000). Most previously docu¬ 
mented crossovers mapped to the SLA class III 
region. However, use of PCR-SSP has resulted in 
the identification of additional recombinations 
corresponding to crossover frequencies of 0.56% 
between the class I and class II regions, and 
0.39% within the class I region, reflecting better 
detection methods (Ho et ai, 2009a,b, 2010). 


Role of SLA genes in immunity 
and swine health 

Several authors have reviewed the potential of 
using genetic approaches to improve animal 
disease resistance (Mallard et ai, 1991; Stear 
et ai, 2001; Rothschild, 2003; Gibson and 
Bishop, 2005; Lewis et ai, 2007; Rothschild 
et ai, 2007; Lunney, 2010). Mallard, Wilkie 
and their colleagues have performed a series of 
experiments to establish populations of pigs 
that they predicted would be more immunologi- 
cally active and thus more resistant to infectious 
diseases; however, the high-immune-response 


pigs were more susceptible to Mycoplasma 
hyorhinis infection (Mallard et ai, 1989a; 
Magnusson et ai, 1998; Wilkie and Mallard, 
1999). Edfors-Lilja et ai (1998, 2000) were the 
first to trace quantitative trait loci (QTLs) regulat¬ 
ing normal immune traits, as detailed below. 
Several QTL-influencing traits, including growth, 
backfat thickness and carcass composition map 
to the SLA complex (Rattink et ai, 2000; 
Bidanel et ai, 2001; Malek et ai, 2001; Milan 
etai, 2002; Rothschild etai, 2007; Nagamine 
et ai, 2009). Overall, these results could help to 
guide breeders in selectively increasing the fre¬ 
quency of certain SLA alleles, i.e. those that are 
known to be associated with enhanced disease 
resistance or QTL effects. 

As management in the pig industry changes, 
the range of pathogens to which pigs are exposed 
is altered. Moreover, as consumers demand pork 
products free of antibiotic contamination, it becomes 
increasingly important that new heartier, health¬ 
ier breeding stock be available (Lewis et ai, 
2007; Rothschild et ai, 2007; Lunney, 2010). 
Disease-resistant pigs, in well-managed facilities, 
would help to decrease drug usage by producers 
and increase the health of the nation’s food sup¬ 
ply. Several groups have attempted to evaluate 
the relationship between the level and function of 
circulating immune cells with average daily gain, 
live and carcass measurements, feed intake and 
feed conversion (Mallard et ai, 1991; Stear 
et ai, 2001; Rothschild, 2003; Gibson and 
Bishop, 2005). Galina-Pantoja et ai (2006) 
showed that the CD16+, CD2+/CD16+, CD8+ 
and SLA-DQ+/- cell subsets appear to be impor¬ 
tant biomarkers involved with the inherent ability 
of the pig to efficiently grow and produce better 
carcass weight in representative commercial 
environments. Clapperton et ai (2009) estimated 
the heritabilities of numerous immune traits and 
measured their relationship with performance 
within both specific pathogen-free (SPF) and 
non-SPF environments. They found negative 
genetic correlations between performance and 
CD11R1+ cells, monocytes and alpha-1 acid 
glycoprotein. 

The influence of SLA-encoded genes on 
immune and disease traits is broad. Based on 
studies using SLA-defined and SLA-inbred lines 
of pigs it was affirmed that SLA genes deter¬ 
mined levels of antibody responses to protein 
and vaccine antigens (Table 6.3). Cellular 
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Table 6.3. SLA gene encoded disease and vaccine responses. 


Immune parameter SLA association Reference(s) 


Antibody response levels 
Anti-lysozyme 

Anti-model antigen 


Anti-sheep red blood cell 
Vaccination for Bordetella 
bronchiseptica 

Cellular responses 
Salmonella bacterial 
phagocytosis 
Delayed contact-type 
hypersensitivity induced 
by tuberculin protein 
Parasite antigen proliferation 
Interferon induction 
Bacterial phagocytosis 
Macrophage superoxide 
production 

Disease responses 
Melanoma initiation; 
tumour incidence 


Primary Trichinella infection 
Secondary Trichinella infection 
Primary Toxoplasma infection 
Primary PRRSV infection 


Higher Hp-14.0; lower Hp-2.0 
Higher Hp-3.3; lower Hp-4a.4 
Higher Hp-4a.4 

Lower Hp-3.3 
Higher Hp-4a.4 
Higher Hp-2.0 


Higher Hp-2.2 
Higher Hp-4a.4 


Higher Hp-3.3 
None significant 
Lower Hp-2.2 
None with class II 


Higher Hp-2.2 

QTLs map to SLA 

Lower parasite burden in Hp-3.3 
Faster anti-parasite in Hp-2.2 
None significant 

Increased viral levels with Hp 1.0; 
decreased serum neutralizing 
antibody with Hp 0.1,0.23, 0.26 


Vaiman et al., 1978 
Lunney et al., 1986 
Lunney et al., 1986; Mallard 
etal., 1989b 
Lunney et al., 1986 
Mallard et al., 1989a 
Rothschild et al., 1984; Meeker 
etal., 1987a,b 

Lumsden et al., 1993 
Wilkie and Mallard, 1999 


Lunney and Murrell, 1988 
Jordan et al., 1995 
Lacey etal., 1989 
Groves et al., 1993 


Tissot et al., 1987, 1989, 1993; 

Blangero et al., 1996 
Hruban et al., 1994; Geffrotin 
etal., 2004; Du etal., 2007 
Lunney and Murrell, 1988 
Madden etal., 1990, 1993 
Dubey et al., 1996 
J.H. Fritz et al. (2010, 
unpublished) 


responses to defined antigens showed weak 
associations with specific SLA haplotypes. 
Because of the difficulty and expense of per¬ 
forming controlled pathogen challenge studies 
in SLA-defined pigs, only limited studies have 
been performed. Lunney and colleagues have 
established that both primary and secondary 
responses to the food-borne helminth parasite, 
Trichinella spiralis , are regulated by SLA- 
associated genes, whereas no such SLA asso¬ 
ciation was found for Toxoplasma gondii 
infections (Table 6.3) (Lunney and Murrell, 
1988; Madden et al., 1990, 1993; Dubey 
et al., 1996). Recent advances in SLA typing 
enabled J.H. Fritz et al. (2010, unpublished) to 
identify SLA class I and II alleles associated with 
the control of PRRSV infections. 


This last decade has seen major progress 
in swine immunology and genetics, and particu¬ 
larly in understanding the SLA complex, its 
genetic loci and the role of SLAs in normal 
immunity and in infectious disease and vaccine 
responses. New technologies now mean that 
comprehensive probing of the role of SLA 
alleles and haplotypes should reveal specific 
antigenic epitopes that stimulate immune and 
vaccine responses. They should help to identify 
critical immune cell subsets and the exact SLA 
loci that facilitate cellular interactions for effec¬ 
tive immune responses, and unveil novel 
immune pathways regulated by SLA genes. 
Overall, the stage is set for determining the 
crucial role of SLA genes and proteins in pig 
health and productivity. 
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The Genetics of the Porcine 
Immunoglobulins (Igs) 

The Ig heavy chain ( IGH) locus isotypes 

and allotypes 

The Ig heavy chain (IGH) locus is located on 
SSC 7q26 (Chaudhary et al., 1997). Swine 
possess the same complement of heavy-chain 
isotypes as other mammals: IgM, IgD, multiple 
subclasses of IgG, and IgE and IgA. These are 
listed in Table 6.4 along with features describ¬ 
ing the gene products they encode, the known 
genetic variants, and the source of sequence 
information for each in GenBank and on the 
Comparative Immunoglobulin Workshop 
(CIgW) web site (CIgW, 2010). As in other 
mammals, IgG is the major serum Ig and IgM is 
the major BCR (Butler et a/., 2009a,c). IgD is 
transcribed in many lymphoid tissues (McAleer 
et a/., 2005), and presumably acts as a second 


BCR. As there are no antibodies specific for 
IgD available, neither its serum concentration 
nor its prevalence as a BCR is known. Fig. 
6.2a illustrates the domain structure of the 
IgM, IgD and IgG3, the three most 5' genes of 
the heavy-chain constant region locus (IG). The 
Cjil and C81 domains are nearly identical and 
IgD can be transcribed with the first domain of 
IgM (C|il). Unlike mice and humans, but simi¬ 
lar to cattle, IgD is associated with a short 
switch region (S8) and, at least in cattle, this 
appears to be functional (Zhao et al ., 2003). 
Both IgM and IgD have both transmembrane 
exons, e.g. C8ml and C8m2, and an exon 
needed for secretion (C5s; Fig. 6.2a). 

There are six expressed IgG heavy chains 
that are designated IgGl to IgG6 (Table 6.4). 
Alleles have been identified for all of them 
except IgG3. Based on their frequency of tran¬ 
scription, IgGl, IgG3 and IgG5 appear to be 
the most important (Butler and Wertz, 2006). 


Table 6.4. Porcine immunoglobulin isotypes. Modified from Butler et al. (2009b). 


Constituent polypeptides 


Isotype or 
subisotype 

Molecular 

mass 3 

S 20 ,w 

H-chain b 

H-chain c 

L-chain b 

Other* 1 

GenBank 

number 

Allotypes 

IgM 

1100 

19.0 

69-74 

63.8 

25 

16 

U50149 

1 

dlgA 

330 

9.5 

58 

51.2 

25 


U12594 

2 

slgA 

380 

11.5 

58 

51.2 

25 

70 



IgGl 

160-170 e 

6.6 

55 

49.6 

25 


U03781; 

2 








U03778 


lgG2 

ND f 

6.6 

ND 

49.6 



U03779; 

2 








U03780 


lgG3 

ND 

6.6 

ND 

50.2 



EU372658 

1 

lgG4 

ND 

6.6 

ND 

49.6 



U03782; 

2 








EU372654 


lgG5 

ND 

6.6 

ND 

49.2 



EU372657; 

2 








EU372656 


lgG6 

ND 

6.6 

ND 

49.2 



EU372655; 

2 








EU372653 


igE 

ND 

ND 

ND 

60.2 



U96100 


IgD 

-160 

ND 

56 


25 


AF411239; 



AF515674 

a Determined by equilibrium sedimentation by Setcavage and Kim (1976) and supported by less rigorous estimates from size 
exclusion chromatography. Estimates for slgA are based on chromatographic behaviour. Values are in kilodaltons (kDa). 
determined by SDS-PAGE and expressed as kDa. 

c Based on deduced amino acid sequence and using a mean HCDR3 length of 42 nucleotides. Differences among Cy 
subclass chain length are too small to be resolved by SDS-PAGE. The deduced mass of the kappa and lambda chains is 
24kDa, i.e. the same as determined by SDS-PAGE, which is consistent with the absence of glycosylation. 
d J-chain and secretory component. 

e lndividual subclass proteins have never been purified. The value given is only for partially purified IgGl, which is the 
major IgG transcribed. 
f ND = not determined. 
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Fig. 6.2. Organization of the porcine Ig heavy chain locus, (a) The 5' portion of the porcine heavy chain 
constant region locus (segment spanning the region from IGHJ5 to IGHG3) is depicted but not drawn to 
scale; some distances are indicated. Annotations: Sp and S5 = the IgM and IgD switch regions, respectively; 
Enh = heavy chain enhancer; Cp1-Cp4 and C51-C83 = domain exons of IgM and IgD, respectively; Cps, 
C8s, CpM and C5M = the secretion and transmemebrane exons of IgM and IgD, respectively. The exons for 
Cy3 and Sy3 are also shown, (b) The genomic map of the 3' portion of the porcine heavy chain variable 
region locus includes the 15 most 3' VH genes, which account for nearly the entire expressed VH repertoire. 
The map overlaps with that in Figure 6.2a; both share the position of IGHJ5. The nomenclature for the VH 
genes corresponds to that given and explained in Table 6.5. Modified from Eguchi-Ogawa etal. (2010). 


Unfortunately there are currently no fully veri¬ 
fied monoclonal antibodies (mAbs) that can 
identify these subclass IgGs, and therefore their 
concentrations in serum or their role in any 
particular immune response cannot be meas¬ 
ured. IgG is the ‘flagship Ig’ for all mammals; in 
nearly all eutherian mammals IgG has been 
diversified after speciation into subclasses, 
except among very closely related species like 
domestic sheep, cattle and goats (Kehoe and 
Capra, 1974; Nguyen, 2001; Butler, 2006; 
Butler et a/., 2009b). Among the artiodactyls 
of the family Bovidae, diversification appears 
to have begun in a common ancestor. It is 
important to appreciate that the names, e.g. 
IgGl and IgG2, among mammals do not imply 
homology. As in other mammals, major differ¬ 
ences among swine subclasses are located in 
the hinge region (Butler et al., 2009b). 
Evolutionary divergence of subclasses appears 


to have been through a combination of gene 
duplication and genomic gene conversion. 
The total number of IgG genes in swine may 
exceed the six subclasses that are expressed, 
as there is evidence for additional pseudogenes 
(T. Eguchi-Ogawa et al., unpublished). In add¬ 
ition there are some animals that have deletions of 
IGHG2 and IGHG6 (Butler et al., 2009b). This 
suggests that the IGHG portion of the heavy chain 
locus has been very flexible during evolution. 

The allelic variants of IGHG are best 
known for IGHG1 and IGHG2. There are out- 
bred swine that are homozygous for IgGl a and 
IgG2 a , especially among some Mieshan that 
are also homozygous for IgA a . The haplotype 
linkages for the allelic variants of the IgG 
subclasses have not been established but should 
be forthcoming with additional mapping stud¬ 
ies and from ongoing investigation of JH5± 
heterozygotes that have only one functional 
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haplotype. An interesting feature of IGHG3, 
which is phylogenetically the ancestral IGHG 
for swine (Butler et a/., 2009b), is its expression 
in fetal and newborn piglets (Butler and Wertz, 
2006). However, once the newborn encounters 
antigen, IgG3 expression drops precipitously 
from 50% to <1% (J.E. Butler et a/., unpublished). 
The map location of IGHG3 is the most IGHD 
proximal IGHG gene. As IgG3 appears in a 
variety of peripheral lymphoid tissues of the 
newborn as the major expressed IgG transcript, 
it may be spliced to the rearranged VDJ (a 
DNA rearrangement between a selected IGHV, 
a selected IGHD and a selected IGHJ gene) in 
the manner of IgD in mice, although a >3 kb 
switch sequence (Sy3) is present (Fig. 6.2a) 
(Eguchi-Ogawa et a/., 2010). 


Genomic map of the constant 
heavy-chain locus 

The genes for IgE (IGHE) and IgA ( IGHA ), in 
that order, are found at the 3' end of the heavy 
chain constant region locus, as is true for other 
mammals (T. Eguchi-Ogawa et ai, unpublished). 
IGHE is highly conserved and most homolo¬ 
gous to human and horse IGHE. Swine IGHA 
is also conserved and most homologous to 
human and dolphin IGHA. Swine IgA occurs 
in two allelic forms; IgA b has a mutated splice 
acceptor site, which results in a four amino acid 
deletion in the hinge (Brown et ai, 1995). The 
expressed gene frequency is much higher for 
IgA a , although, up to now, the hinge deficiency 
of IgA b does not seem to have any deleterious 
effect. Rather, the gene frequency for the two 
allotypes appear founder related (Navarro 
et a/., 2000). Both alleles appear to be equally 
susceptible to cleavage by proteases of 
Haemophilus parasuis (Mullens et ai, unpub¬ 
lished). As in other mammals, IgA is the major 
mucosal Ig for swine, including in mature milk 
(Porter, 1969; Klobasa and Butler, 1987). The 
IgA and IgM repertoires were more diverse in 
ileal than in jejunal piglet Peyer’s patches (PP), 
reflecting a more diversified microflora in the 
ileal PP (Levast et ai, 2010). IgA responses to 
natural mucosal infections are well docu¬ 
mented, such as with foot-and-mouth disease 
viruses, whereas killed vaccines preferentially 


stimulate IgG responses (Pacheco et ai, 2010). 
While verified mAbs are available for the study 
of IgM and IgA, those for IgE, IgD or the vari¬ 
ous subclasses of swine IgG are not. 

The variable heavy chain locus is com¬ 
posed of a yet undetermined number of IGHV 
genes, four diversity IGHD segments and five 
IGHJ segments (Eguchi-Ogawa et ai, 2010). 
Nearly 30 IGHV genes have been identified 
(Butler et ai , 2006a; CIgW, 2010). The 15 
3'-most IGHV genes are listed in Table 6.5 and 
their position is shown in Fig. 6.2b. All porcine 
IGHV genes belong to the ancestral VH3 fam¬ 
ily of clan III (Sun et ai ., 1994). The porcine 
IGHV genes were originally named according 
to their order of discovery and referred to in the 
vernacular as VHA, VHB , etc., because their 
position in the locus was unknown at the time 
of their discovery. Since the recent mapping of 
the locus (Fig. 6.2b) these genes can now be 
renamed according to the nomenclature used 
for other well-studied species. We provide both 
the familiar vernacular (VH1, VH2, etc.) and 
the international ImMunoGeneTics (IMGT) 
nomenclature (IMGT, 2010) for these genes in 
Table 6.5 (Eguchi-Ogawa et ai, 2010). Among 
these are almost all of the genes that comprise 
the expressed repertoire of fetal and young pigs 
(Sun et al ., 1998; Butler et a/., 2006a), as well 
as a number of pseudogenes. 

All porcine IGHV genes have identical 
recombination signal sequences (RSS), which 
simplifies studies on B cell lymphogenesis 
(see below). Various IGHV genes appear as 
hybrids by sharing complementarity determin¬ 
ing region 1 (CDR1) and CDR2 sequences, 
whereas their framework (FR) regions are virtually 
identical (Sun et al ., 1994; Butler et al., 
2006a). Therefore, it was postulated that they 
evolved by a combination of gene duplication 
and genomic gene conversion similar to the 
IgG subclasses of swine, and by the mecha¬ 
nism described by others (Meselson and 
Radding 1975; Szostak et al., 1983; Butler 
et al., 2009b). There is also a major segment 
duplication within the 3' region of the locus 
that involves the unit containing VHA*, VHF 
and VHB*, with the unit containing VHA, 
VHT, VHB and VHE (Fig. 6.2b). Repetitive 
sequences around these genes suggest that 
the duplication has been recent (Eguchi-Ogawa 
et al., 2010). 
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Table 6.5. Nomenclature for swine Ig DH, JH and 3' VH Genes. 3 


IMGT nomenclature 

Vernacular 

nomenclature 

Number of 

exons 

CDS (coding 
sequence) length 
(bp) 

Comment 

IGHV 15 

VHN 

2 

353 


IGHV14 

VHY 

2 

353 


!GHV13pP 

P 

1 

213 

Exon 1 and part of exon 





2 were collapsed 

IGHV12 

VHB* 

2 

359 


IGHV11 

VHF 

2 

359 


IGHV10 

VHA* 

2 

353 


IGHV9p 


1 

213 

Exon 1 and part of exon 





2 were collapsed 

IGHV8 

VHE 

2 

359 


IGHV7p 


1 

213 

Exon 1 and part of exon 





2 were collapsed 

IGHV6 

VHB 

2 

359 


IGHV5 

VHT 

2 

359 


IGHV4 

VHA 

2 

353 


IGHV3p 


1 

213 

Exon 1 and part of exon 





2 were collapsed 

IGHV2 

VHG 

2 

359 


IGHVIp 

P 

2 

352 

Stop codon in exon 2 

IGHD1 

DHA 

1 

38 


IGHD2 

DHB 

1 

28 


IGHD3 


1 

37 


IGHD4 


1 

11 


IGHJ1 


1 

54 


IGHJ2 


1 

53 


IGHJ3 


1 

48 


IGHJ4 


1 

51 


IGHJ5 

JH5 

1 

54 



a VH gene names were classified according to complementarity determining region 1 (CDR1) and CDR2 sequence 
specificities (Butler et al., 2006b). The vernacular terminology is based on order of discovery, whereas the IMGT 
(ImMunoGeneTics) system is based on the position in the genome as shown in Fig. 6.2b. 
b p= pseudogenes discovered in genomic DNA before mapping studies (see Butler et al., 2006b). 


For the last 15 years, only two IGHD and 
one IGHJ gene segments were known to be 
expressed (Butler et a/., 1996; Sun and Butler, 
1996). The recent mapping studies (Fig. 6.2) 
indicated that only two of the four IGHD seg¬ 
ments are functional and only the last IGHJ seg¬ 
ment (JH5) is functional (Eguchi-Ogawa et al., 
2010). In most cases, those segments not used 
have non-canonical RSS. Further evidence that 
swine have only one functional JH segment was 
demonstrated by disrupting IGHJ5, which 
results in an IGHJ5-/- piglet that completely 
lacks B cells (Mendicino et al., 2010). Swine are 
currently the only species in which disruption of 
a single IGHV gene segment eliminates the 
entire humoral immune system and, should it 


occur naturally, would result in the death of all 
affected pigs as well as heterozygous JH5-/+ 
piglets (Mendicino et al., 2010). 

Further studies on the germ line IGHV rep¬ 
ertoire will require mapping the region upstream 
from that shown in Fig. 6.2. This may also 
require the use of bacterial artificial chromo¬ 
some (BAC) libraries prepared from swine of 
different breeds to account for the VH genes not 
accounted for by the current map (Fig. 6.2; Table 
6.4). It is conceivable that a number of the IGHV 
genes that have been reported (Butler et al., 
2006b) are alleles of those listed in Table 6.4. 
The NIH minipigs, which were selected for their 
SLA haplotypes, are homozygous for IGHA 
(Navarro et al., 2000), and usually for IgGl a 
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(J.E. Butler and N. Wertz, unpublished); they 
could be valuable for such extended studies. 


Genomic organization of the Ig kappa 
and lambda light-chain loci 

Like other mammals, swine possess two light- 
chain loci. The Ig lambda light chain (IGL) 
locus maps to SSC14ql7-q21 whereas the Ig 
kappa light-chain ( IGK) locus maps to 3ql2- 
ql4 (Rettenberger et a/., 1996; Chaudhary 
et a/., 1997). The IGK locus in swine is organ¬ 
ized in the same translocon manner as kappa 
in other mammals and as the Ig heavy-chain 
locus (Butler et a/., 2004). The kappa locus 
contains a single IGKC gene and five IGKJ 
gene segments, which align very well to the 
IGKJ segments of the human, sheep, rabbit, 
rat, horse and mouse. These are located 
3 kb upstream from IGKC and are separated 
by -300 nucleotides from each other. IGHJ1 
has a canonical RSS but this is not true for the 
heptamers of IGKJ2-IGKJ5. Nevertheless, 
Jk 2 accounts for 90% of all IGKJ usage in 
newborn piglets. Thus, the exchange of an A 
for C in the heptamer does not seem to matter. 
While not extensively studied in animals of dif¬ 
ferent age and breed, it has not been confirmed 
whether IGKJ3-IGKJ5 are used and that their 
less canonical heptamers occur in all swine. 

The IGKV genes of swine belong to at least 
two families: IGKV1 (VkI) and IGKV2 (Vk2). 
There are >60 IGKV2 genes belonging to at 
least six subfamilies. The number of IGKV1 
genes has not been estimated but they appear 
to belong to at least two to three subfamilies. Both 
IGKV1 and IGKV2 genes share 87% sequence 
similarity with the same families in humans 
(Butler et a/., 2004). Kappa light chains in 
young and adult pigs appear to account for 
-60% of total light chain usage. This is very 
similar to that in humans. However, at sites of 
early B cell development there appears to be 
preference for lambda usage, which probably 
represents expression of X5, which is similar to 
the pre-BCR in mice (Butler et aI., 2006b). 

The IGL is less well characterized than 
kappa. There has never been a focused study 
of this locus. What is known is that IGL is 
organized as in other mammals with three to 


four tandem-arrayed IGLJ-IGLC cassettes that 
are located downstream from an unknown 
number of VX genes. These belong to at least 
two families, IGLV3 and IGVL8 (Butler et a/., 
2006a; K. Wells and J.E. Butler, unpublished). 


B cell lymphogenesis and development 
of the porcine VH repertoire 

B cell lymphogenesis is initiated when lympho¬ 
cyte stem cells begin to rearrange their Ig 
genes. This starts with IGHD to IGHJ rear¬ 
rangements and may occur simultaneously on 
both alleles, as it does in well-studied species, 
and is followed by rearrangement of IGHD- 
IGHJ to an IGHV gene. The process generates 
signal joint circles of intervening DNA that 
includes the RSS. These can be detected by 
PCR (Fig. 6.3a). Evidence for heavy chain 
rearrangement is also obtained by PCR, using 
a primer set spanning a region from FR1 of a 
IGHV gene to the 3' end of JH, which includes 
FR4. This part of the process starts in the fetal 
yolk sac at 20 days of gestation (20 DG). At 
30 DG the process becomes evident in the 
fetal liver, and the primary transcript of the 
rearranged VDJs are then spliced to the C(i 
chain and expressed on the surface, together 
with a surrogate light chain comprising a ^5 
light chain and VpreB to form the pre-BCR. 
Porcine VpreB is highly homologous to that in 
mice and humans (Butler et a/., 2009a). 
However, there are currently no molecular or 
antibody reagents that can distinguish the 
swine IGL gene products, e.g. IGL5 from 
other IGLC sequences. Surface IgM, i.e. the 
initial BCR, has not been detected in fetal liver, 
so the pre-BCR is probably expressed on only 
very few cells. In mice, the surrogate light chain 
is replaced by an authentic Ck light chain. This 
remains unknown for swine because of the 
early predominance of C^ expression in fetal 
piglets (Butler et a/., 2005c). As mentioned 
above, this early X expression could represent 
X5, which is part of the pre-BCR, or could indi¬ 
cate that swine rearrange lambda before kappa. 
Later in development, the ratio of CA,:Ck is 
about 1, similar to that in humans (Butler et a/., 
2005b). The first expression of IgM on the 
B cell surface is at 45 DG in fetal liver (Sinkora 
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(a) (b) 



(c) 


VH FR3 

ARG 

TGTGCAATA 

TGTGCAAAA 

TGTGCAAGA 

TGTGCACAG 

TGTGCAAGA 

TGTCAATTG 

TGTGCAAGG 

TGTGCAAAA 

TGTGCAAAA 

TGTGCAACA 


CDR3 


GCGGTACGCTCAT 

AAC GGTTGCTATAGCGGTTACTGTGG 

GGCCGG GCGGTTGCTATAGCGGTTAC GGGAAGATATGAGAGGGGACATTTAGGCAAACCA 

ATAGCGGTT GCTATAGCGGTTAC CGGTCGATGGG 

GCCCTAGAT TATAGCGGTTGCTATAGC ATCAAGACGGGG 

GGTTGCTATAGCGGTTAC GAGTG 

TGGGGGTTCCCC 

ATAGCTATGGTGCTAGTTGCTAT TGAA 

CGGGGCTCTCATGGTGGCAGGATGAGATCA 

GGCGTAGATC 


ATTACTATGC TATGGATCTC 
TTACTATGCTAT GG AT CT C 
TACTATGCTATGG AT CT C 
TGGATCTC 
ATGGATCTC 
TCTC 

TACTATGCTAT GG AT CT C 
TATGCTATGGATCTC 
ATGGATCTC 
ATTACTATGCTATGG AT CT C 


v 



5'J h 5 


FR4 

IBP 

TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
TGGGGCCCAGGCGTTGAAGTCGTCGTGTCCTCA 
v y 

3'J h 5 


Fig. 6.3. Swine Ig VDJ rearrangement and repertoire diversity in CDR3. (a) Recovery of VH-DH and 
DH-JH signal joint circles from bone marrow via RAG (recombination-activating gene)-mediated 
recombination of VDJ gene segments, which involves the excision of the intervening sequences between 
exons. Evidence that such rearrangements occur is based on recovery of the intervening sequences in 
the form of signal joint circles. As swine have only two functional DH segments and one functional JH 
segment (JH5), the process of B cell lymphogenesis is much easier to follow than in mice or humans. The 
length of the four different signal joint circles that can be generated during rearrangements in the swine 
heavy chain locus are indicated, (b) CDR3 length analysis (spectratyping) of CDR3s expressed with IgM, 
lgG3 and IgA in the ileal Peyer’s patches of a newborn piglet. Noteworthy is the Gaussian distribution of 
length associated with IgM, which characterizes an unselected repertoire, compared with a highly 
selected lgG3 repertoire and a somewhat selected repertoire for IgA. (c) CDR3 sequences in VDJ 
rearrangements expressed with VHA (IGHV4; Table 6.4). The 3' region of FR3 of VHA and FR4 encoded 
by JH5 are indicated. Identifiable portions of DHA and DHB are underlined. Nucleotides between JH and 
FR3 that are not underlined represent terminal deoxynucleotidyl transferase (Tdt)-dependent additions. 


et al., 1998; Sinkora and Butler, 2009). 
Because the IGL locus genes have not been 
characterized, exactly which IGLV-IGLJ-IGLC 
or IGLV-V preB rearrangements are first used 
remains unknown. However, when kappa light 
chains are later expressed, IGKV2 family 
genes rearranged to IGKJ2 predominate 
(Butler et a/., 2004). 

Much more is known about IGHV usage 
during fetal and neonatal life, and how usage 
leads to the appearance of the heavy-chain 
pre-immune repertoire, than is known about 
the light-chain repertoire. It has been recog¬ 
nized for 15 years that swine use very few VH 


genes (Sun and Butler, 1996), and this pattern 
persists throughout most of fetal life (Sun et a I., 
1998). The major genes that are used are 
VHA, VHB, VHE, VHP, VHC and VHA* 
(Table 6.5). With the exception of VHC , all of 
these are located among the VH genes at the 
3' end of the locus, although VHG , which is 
the terminal functional VH gene, is seldom 
used, while IGHV15 (VHN) is used in 20 DG 
yolk sac (Fig. 6.2b; Eguchi-Ogawa et al., 
2010). Interestingly, IGHV15 (VHN) is 
expressed very early, but usage then declines in 
late fetal life. By monitoring the relative usage 
of the major IGHV genes described above, it is 
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possible to construct a repertoire diversifica¬ 
tion index (RDI), the numerical value indicating 
the degree of diversity of the BCR. We have 
shown that there is a tendency for greater 
diversity at 20 DG, and this becomes con¬ 
tracted at 30 DG and remains so during the 
remainder of fetal life. However, the RDI 
increases after birth in antigen-sensitized ani¬ 
mals (Butler et a/., 2006b; J.E. Butler et a/., 
unpublished). Much of this increase is the result 
of SHM. We have speculated that once the 
pre-BCR or BCR appears, e.g. at 30 DG, there 
is positive B cell selection, perhaps by stromal 
ligands in a manner similar to what results in 
the proliferation of pre-B cells when they 
express a pre-BCR (Melchers, 1995). 

As swine use very few IGHV genes and 
have only two functional IGHD genes and one 
functional IGHJ (IGHJ 5), developing a reper¬ 
toire comprising 10 10 different B cells (Cohn, 
2006) is dependent on the structure of CDR3 
as well as on SHM. Because of the restricted 
VH repertoire of swine, we estimated that 
>95% of the swine repertoire is determined by 
CDR3 diversity (Butler et a/., 2000). CDR3 is 
the sequence resulting from the junction of 
V-D-J (Fig. 6.2b) during rearrangement, and 
extends from the terminal arginine codon of 
FR3 of the VH gene to the invariant tryp¬ 
tophan codon of FR4 in the 3' portion of the 
JH segment (Fig. 6.3c). Diversity within CDR3 
mostly results from exonuclease activity, which 
can shorten especially IGHD and IGHJ , and 
by insertion of nucleotides between IGHV , 
IGHD and IGHJ segments through the action 
of terminal deoxynucleotidyl transferase. The 
CDR3 region is also the target of SHM, but the 
extent of SHM is difficult to quantify compared 
with that for IGHV genes themselves (Fig. 6.3). 
Figure 6.3b illustrates that CDR3 regions differ 
in length as well as sequence (Fig. 6.3c), even 
when rearranged to the same IGHV gene. The 
origin of the sequence, i.e. from IGHD , IGHJ 
or inserted nucleotides, is indicated in Fig. 6.3c. 
Among the thousands of VDJs from swine that 
we have sequenced, we have never found the 
same sequence unless we accidentally recov¬ 
ered two VDJs from the same B cell. In other 
words, CDR3 is clone specific. In addition to 
differences in sequence, most CDR3s are of 
different length, so using a length analysis 
method called spectratyping it is possible to 


monitor the clonality of a particular lymphoid 
organ. Figure 6.3b shows that the repertoire of 
IgM appears unselected because the spectra- 
type pattern is Gaussian, whereas the spectra- 
type for IgG3 is especially selected, and that 
for IgA is selected to a lesser extent. Thus, 
spectratyping is a convenient way to study the 
genetics of the B cell repertoire. The impor¬ 
tance of CDR3 to antibody specificity was 
demonstrated by Xu and Davis (2000), who 
showed that transgenic mice with only one 
IGVH gene, but with the IGHD and IGHJ 
region of the locus intact, could make antibod¬ 
ies to nearly all antigens. 


Modification of the BCR through somatic 
hypermutation (SHM) in swine 

SHM depends on the expression of activation- 
induced cytidine deaminase (AID), a member of 
the AFOBEC family of cytosine deaminases 
(Conticello et a/., 2005; Chiu and Greene, 
2008). This enzyme is primarily expressed in 
germinal centres following antigen stimulation 
(Muramatzu et a/., 1999). However, there is 
very little germinal centre formation during fetal 
life in any mammal. In fetal piglets, the SHM 
frequency is <20 mutations per kilobase (Table 
6.6). This frequency is constant across the vari¬ 
ous segments of the IGHV genes. This sharply 
differs from that in piglets exposed to environ¬ 
mental antigen in which mutations accumulate 
in CDR1 and CDR2. A similar pattern is known 
for mice (Berek and Milstein, 1987). We know 
that part of the cause for the rise in repertoire 
diversification in antigenized piglets is the result 
of SHM (Butler et a/., 2006b). 


The Genetics of the Porcine T Cell 
Receptor Repertoire of Swine 

Organization of the TCR loci in swine 

Like other mammals, pigs have TCRs that con¬ 
sist of a/p and y/5 heterodimers, TCR a/p and 
TCR y/5, respectively. The TCR 5 chain locus 
(TRD) is embedded in the TCR a chain locus 
(TRA) and located on the middle of the q-arm 
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Table 6.6. Mutation frequency (mutations/kilobase) of transcribed V H 3 genes of piglets of different age 
and treatments. 3 


Source 

Number 

FR1 

CDR1 

FR2 

CDR2 

FR3 

Total 

Newborn 

7 

16.8 ± 10 

28 ± 12 

0 (NA) 

35 ±28 

20 ± 10 

17.9 ±7 

piglet 

Colonized 

10 

8.2 ±11 

140 ± 56 a 

2.3 ±2 

104 ± 60 b 

31 ±24 

34.8 ± 13 b 

isolator 

Conventional 

15 

14.9 ±7 

195 ± 66 b 

12.6 ±7 

CO 

i+ 

cn 

IT 

79 ± 56 d 

68.6 ± 25 b 


5 week 0 

a Mean values for each region of the V H gene and for the total among different groups were compared by Student’s t tests. 
b Significantly higher than in newborn piglets (P< 0.007-0.0001). 
c Reared conventionally and infected with helminth parasites. 
d Significantly higher than in newborn and colonized piglets (P< 0.008). 


of chromosome 7 (7ql5.3^q21) (Hiraiwa 
et a/., 2001; Uenishi et a/., 2003). The TCR p 
(TRB) and y (TRG) loci are mapped on the 
q-arms of chromosome 18 (18qll.3—>ql2) 
and 9 (9q21—>q22), respectively (Eguchi-Ogawa 
et a/., 2009). The TRA and TRG chain genes 
use variable (TRV) and joining ( TRJ) segments 
for genomic recombination to generate func¬ 
tional genes, whereas TRB and TRD use TRV, 
diversity (TRD) and TRJ segments for recombi¬ 
nation. Among pig TR loci, detailed genomic 
sequences from constant regions to the 
3'-region carrying V segments are available in 
TRA/TRD and TRB. In pig TRA, 62 TRAJ 
segments were observed, and the sequence of 
most of the segments was highly conserved to 
their human and mouse genome counterparts 
(Uenishi et al., 2003). Humans and pigs com¬ 
monly have four TRDJ segments on their 
genomes. Humans and mice have four and 
three TRDD segments, respectively; however, 
in pig TRD, seven TRDD segments were 
observed, although one of the TRDD segments 
is considered to be non-functional. This sug¬ 
gests that the pig TRD locus has an ability to 
generate broader diversity of the rearranged 
TCR gene than the human and mouse loci, 
particularly in CDR3. Comparison of the 
genomic region carrying TRDD segments 
between humans and pigs demonstrates a four¬ 
fold duplication in the pig genome, suggesting 
there may be even more functional TRDD seg¬ 
ments located in this region. 

As for the genomic region carrying TRAV/ 
TRDV segments, many units of duplications 
are observed in pigs compared with humans. 


Particularly, genomic segments carrying 
TRDV1 and several TRAV segments showed 
at least fourfold duplication in the pig genomic 
sequence, which to date covers only the 3' 
part of the entire TRAV-TRDV region (Fig. 
6.4) (Uenishi et al., 2009). For TRB loci, the 
3'-proximal regions carrying TRBV and the 
entire region containing TRBD-TRBJ-TRDC 
units were completely sequenced. The compo¬ 
sition of individual sets of TRBD-TRBJ-TRBC 
does not have remarkable differences between 
humans, mice and pigs. However, three TRBD- 
TRBJ-TRBC sets were observed in the pig 
genome, whereas the human and mouse 
genomes have only two (Fig. 6.5); all of the 
three units in pigs are considered to be func¬ 
tional by observation of locus-specific expres¬ 
sion (Eguchi-Ogawa et al., 2009). 


The expressed TCRVp ( TRBV) 
repertoire of swine 

Nineteen families of porcine TRBV genes fit¬ 
ting into seven supergroups have been identi¬ 
fied (Butler et al., 2005b). Based on >70% 
sequence similarity, we identified 17 that are 
sequence homologues of recognized human 
TRBV families (Fig. 6.6). Many of the same 
TRBV genes were also reported by Baron 
et al. (2001). The porcine gene families were 
given the same names as their human counter¬ 
parts, e.g. p TRBV 12 corresponds to 
h TRBV12. One family ( pTRBVX) has no 
human homologue. TRBV families that belong 
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Fig. 6.4. Dot-plot analysis for comparison between porcine and human germ line sequences encoding 
theTCR a/6 chain genes (TRA/TRD). The pig genomic sequence carrying TRA/TRD segments (top, 
horizontal) were reconstructed from the registered data in DDBL/EMBL/GenBank (AB457789, 

AB182374, AB182373, AB182372, AB182371 and AB053451) and subjected to masking of genomic 
interspersed sequences. The human counterpart sequence (left, vertical) was also reconstructed from 
the publicized data (AE000660-AE000662). Locations of human TCR a/6 gene segments are annotated 
according to IMGT (lmMunoGeneTics)/LIGM-DB (Ehrenmann et al., 2010). The TRDV1 segments in the 
human and porcine genomes are indicated by closed triangles (on the vertical and horizontal sequences) 
and broken lines. The repeated units carrying TRDV1 and TRDD segments are depicted by dark (a) and 
light grey (b) rectangles, respectively. 
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Fig. 6.5. Genomic structure of the porcine TRBD-J-C gene clusters. CDSs (coding sequences) and 
repetitive sequences are indicated. Intron lengths of the TRBC segments are indicated in parentheses. 
Characteristic repetitive sequences detected in the region are also indicated. Recombination signals of 
the TRBJ3-A segments started within TRBJ3-2, suggesting that TRBJ3-A was a pseudogene. 


35% 

57% 

73% 

89% 




V19_B 

gDP-46 

V14_B 

gT204 

V12_B 

gS44 

5 

CO 

5* 8 

CO 

M102 

gDP-202 

4-1501 

T101 

T144 

M101 

T134 

4-1406 

M406 

gDP-230 

gM225 

T1502 

V21_B 

8-1503 

M507 

M505 

T501 

^ ^ <2i <£. 

oi cn a s 

oo r\> o> _ 

V4_B 

gDP-2 

CO < 

C D rv> 
-*• o 

CD 

V100_B 

gT203 


V 

V 

V 

1 

V 

V 


V 

V 

V 

V 

V 

V 

V 

V 

V 

VV 

V 

IMGT 

19 

27 

10 

6 

25 

15 : 

3 

4 

7 

12 

2 

11 

9 

5 

21 

20 

29 

30 

X 

BUTLER 

(1) 

(1) 

(1) 

(1) 

(4) 

(6) 1 

:i) (64) 

(82) 

(71) 

(4) 

(1) 

(13) 

(45) 

(20) 

(11) 

(1) 

(1) 

(1) 

BARON 

17 

14 

12 


11 

24 

_ _ 

7 

6 

8 

22 

21 

1 

5 

10 

2 

4 

20 

100 


Fig. 6.6. The expressed porcine TRBV genes belong to 19 families. The porcine TRBV families were 
renamed using the human nomenclature in the IMGT (ImMunoGeneTics) database. The nomenclature 
used by Baron et al. (2001) is also given. The numbers in parentheses are the number of clones of each 
family that were recovered in the analysis by Butler et al. (2005b). 


to the same supergroup share the same leader 
sequence: pTRBV4 , 5, 7 and 12 were most 
frequently recovered. TRBV genes of seven 
families were only recovered from peripheral T 


cells and not from thymocytes. Only pTRBV2-9 
and -15 were recovered in large enough num¬ 
bers to render this observation statistically sig¬ 
nificant. No pTRBV family was exclusively 
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recovered from thymocytes. Most frequently 
used families are multigenic, but pTRBV5 and 
pTRBV12 may be represented by a single gene 
(J.E. Butler and N. Wertz, unpublished data). 

The expression of porcine TCRV8 ( TRDV) 

gene families 

The expression of the five TRDV families 
(TRDV1-TRDV5) has been studied in swine of 
various ages (Holtmeier et al., 2002, 2004). 
Clonal diversity was measured by a combina¬ 
tion of spectratyping (see example of method in 
Fig. 6.3b) followed by recovery and sequence 
analysis of CDR3 regions. Organ-specific 
spectratypic patterns suggested compartmen- 
talization of T cells expressing different TRDV 
gene families in the spleen, lung, intestine and 
colon. As expected from their limited antigen 
exposure, the ileal PF and caecum were more 
oligoclonal in germ-free piglets than in specific 
pathogen-free piglets (Holtmeier et al., 2002). 
The TRDV repertoire also showed increasing 
clonal restriction with age and development, 
especially in the intestine. Nevertheless, 
sequence analysis of recovered CDR3 sequences 
revealed that some of the same T cell clones 
were widespread (Holtmeier et al., 2002). The 
compartmentalization in adult swine is diagram- 
matically shown in Fig. 6.7, and the overall 
pattern is reviewed in greater detail elsewhere 
(Butler et al., 2006a). 


One feature of the TCR that has hereto¬ 
fore been less well realized than in studies on 
BCR expression is the expansion of certain T 
cell clones bearing so-called invariant receptors. 
This has also been shown for the V53-J53 
receptor in swine (Holtmeier et al., 2004). An 
invariant TRDV TCR has also been shown in 
mice (Havran et al., 1991). Expression of 
clones bearing this receptor in swine dominates 
the TRDV3 repertoire of all organs at mid¬ 
gestation. At the end of gestation, clones bear¬ 
ing this receptor were absent in the thymus but 
were still present in intestine and spleen. This, 
as reviewed by Butler et al. (2006a), may indi¬ 
cate a role for the y/5 T cells in innate immu¬ 
nity, just like the dominant BCR expression in 
fetal piglets discussed above. Both phenomena 
may suggest positive selection of both T and B 
cells during development, which would be con¬ 
sistent with the presence of a grey area between 
innate and adaptive immunity (Flajnik and Du 
Pasquier, 2004). A role for y/5 T cells in innate 
immunity has also been suggested by others 
(Takamatsu et al., 2006). 

Quantitative Trait Loci (QTLs) for 
Immune Response Traits 

Numerous studies have been aimed at map¬ 
ping QTLs for traits related to immune 
responses (Table 6.7) (Edfors-Lilja et al., 
1998, 2000; Wattrang et al., 2005; Reiner 
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Fig. 6.7. Compartmentalization of the y8TCR repertoire in adult swine. The various patterns indicate 
shared (same pattern) or unshared (different pattern) repertoires. The vast majority of y8T cells are 
compartmentalized. A highly polyclonal repertoire was always present in the spleen. A minority of clones 
are shared between the intestine and lung. Adapted from Holtmeier et al. (2002) and Butler et al. (2006a). 




























Table 6.7. Quantitative trait loci (QTLs) associated with pig health traits. 


Trait names 

Abbreviation 

QTL map location as SSC:cM map position (range) 

Reference(s) 

Blood parameters 

Haptoglobin concentration 

HAPT 

3: (18-60); 5: (80-119); 6: (88-92); 8: 31; 9: (73-100); 122; 10: 

191; 11 : 58; 12: 31; 37; 14: (7-60); 105; 17: 9; (48-94) 

Wimmers etal., 2009 

Haematocrit 

HCT 

1 : (52-81); 76; 2: (27-42); 3: (17-42); 4: 61; 5: 72; 6: 63; 7: 
(58-90); 63; 66; 8: (52-62); (74-83); (80-107); 10: (67-101); 
69; (101-128); 13: 33; (35-70); (70-102); 18: 32 

Wattrang etal., 2005; Reiner etal., 
2007a; Zou etal., 2008 

Platelet count 

PLTCT 

2: 65; 3: 42; 53; 5: 31; 40; 6: 91; 7: 30; 13: 76; 18: 37 

Reiner etal., 2007b;Yang etal., 2009 

Red blood cell count 

RBC 

2: (27-42); 5: 72; 7: (58-90); 77; 90; 8: 61; 63; 64; (80-107); 13: 
(35-70); (70-102); 15: (34-81); 18: (31-45) 

Reiner etal., 2007a; Zou etal., 2008 

Disease resistance 

ADV antibody titre 

ADVAB 

2: 87; 6: 84 

Wimmers etal., 2009 

Anti-K88 Escherichia coli Ig 

K88Ab 

5:64 

Edfors-Lilja etal., 1998 

Anti-0149 E. coli IgG 

0149Ab 

6:69 

Edfors-Lilja etal., 1998 

Melanoma susceptibility 

MELAN 

1:49; 78; 85; 88; 2: 10; 17; 18; 23; 26; 3: 42; 4: 4; 84; 6: 5; 17; 

108; 140; 7: 4; 49; 73; 8: 22; 58; 60; 56; 9: 42; 67; 10: 42; 54; 

12: 96; 13: 56; 71; 81; 14: 34; 46; 98; 15: 86; 16: 45; 17: 

36; 40; 45; 53; 18: 19 

Du etal., 2007; Gomez-Raya etal., 2007 

PRRSV antibody titre 

PRRSVAB 

1:56; 7: (81-102) 

Wimmers etal., 2009 

Resistance/susceptibility to 

PrV 

5: 31; 6: 41; 9: 139 

Reiner et a!., 2002 

pseudorabies 

Sarcocystis miescheriana IgG 

SMIGG 

4: 39; 5: 104; 126; 7: 156; 8: 89; 12: 33; 17: 83; X: 58 

Reiner etal., 2007b 

S. miescheriana IgM 

SMIGM 

5: 58; 12: 96; 13:19 

Reiner etal., 2007b 

E. coli FI 8 receptor 

ECF18R 

6:76 

Meijerink etal., 1997 

Immune capacity 

Band-formed neutrophil no. 

BFNEUT 

1 : (79-103) 

Wattrang etal., 2005 

Basophil no. 

BASO 

8: 126; 13: (35-70) 

Reiner et al., 2008 

Complement (C') C'3c 
concentration 

C3C 

1 : 139; 2: 23; 25; 29; 101; 3: (60-72); 4: (79-120); 5: 40; 

55; 224; 6: (88-93); 7: 60; (102-135); 8: 164; 10: 191; 

12: 147; 14: 24; 43; 91; 15: 73; 16: (0-47); 36; (47-93); 

75; 79; 18: 36 

Phatsara etal., 2007; Wimmers etal., 
2009 

CD2+ leucocyte no. 

CD2L 

1 : (43-89) 

Wattrang et al., 2005 

CD4+ leucocyte no. 

CD4L 

1: (43-89); 8: (74-83) 

Wattrang etal., 2005 
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CD8+ leucocyte no. CD8L 

conA-induced cell proliferation conA 


1: (79.4-102.9) 
7: 73; 8: (74-83) 


Wattrang etal., 2005 
Edfors-Lilja etal., 1998; Wattrang etal., 
2005 


Eosinophil no. 

EOS 

1: 123; 1 1 : 69; 1 2: 7; 1 7: (0-32); 22; 26; 41; X: (0-35); 31; 125 

Reiner et al., 2007b; 2008 

Haemolytic C' activity 

AH50 

2: 17; 20; 28; 3: 117; 4: (79-120); 7: 47; 1 1: 24; 30; 1 7: 69 

Wimmers et al., 2009 

(alternative pathway) 

Haemolytic C' activity 

CH50 

2: 92; 4: (79-120); 6: 81; (93-102); 8: (52-62) 

Wimmers et al., 2009 

(classical pathway) 

lgM+ leucocyte no. 

IGML 

1:(53-81) 

Wattrang etal., 2005 

Lymphocyte no. 

LYMPH 

3: 18; 7: (30-58); 64; 8: (38-58); (52-62); -1.3; 11: (0-33); 12: 31; 
95.8; 14: 51; 76; 15: 92; 17: (63-94); 18: (0-32) 

Wattrang et al., 2005; Reiner et al., 2008; 
Yang et al., 2009 

MHCII+ leucocyte no. 

MHCIIL 

8: (74-82.8) 

Wattrang etal., 2005 

Monocyte no. 

MONO 

2: 64; 8: (107-126); 11: (0-33); 12: (33-65) 

Reiner et al., 2008 

N1c+ leucocyte no. 

NIcL 

8: (52-62) 

Wattrang etal., 2005 

PHA proliferation 

PHA 

1 : (79-89) 

Wattrang etal., 2005 

Post-stress mitogen induced 

IL2ACT 

6: 109; 12: 102 

Edfors-Lilja et al., 2000 

IL-2 activity 

PWM-induced cell 
proliferation 

PWM 

4: 75; 8: (74-83) 

Edfors-Lilja etal., 1998; Wattrang etal., 
2005 

Segmented neutrophil no. 

SNEUT 

1: 79; (81-94); 2: (0-27); 4: 8; 7: 67; 8: (38-58); -1.3; 11: (0-33); 
12: 96; 16: (9-24); 17: (0-32) 

Wattrang et al., 2005; Reiner et al., 2008; 
Yang etal., 2009 

Spontaneous cell proliferation 

SPONT 

13: 107 

Edfors-Lilja et al., 1998 

Stress-induced alteration in 

IMM 

8: 60 

Edfors-Lilja et al., 2000 

no. of neutrophils 

Stress-induced leucocyte 

LEUKPRO 

2: 78 

Edfors-Lilja etal., 2000 

proliferation 

White blood cell counts 

WBC 

1: 78; (79-94); (94-123); 2: 64; 7: 65; 8: (-1-38); (80-107); 10: 
(20-39); 12: 38; (80-96); 15: 83 

Edfors-Lilja etal., 1998; Wattrang etal., 
2005; Reiner et al., 2008; Yang et al., 
2009 

Pathogen 

Parasite load 

PARAS 

2: 64.3; 7: 60.7; 16: 24.3cM 

Reiner et al., 2007b 
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et al., 2007a,b, 2008, 2010; Zou et al., 
2008; Wimmers et al., 2009; Yang et al. 
2009). QTLs are generally identified by com¬ 
paring the linkage (degree of covariation) of 
polymorphic molecular markers and pheno¬ 
typic trait measurements. The Fig QTL data¬ 
base, PigQTLdb (2010a), is a relational 
database that provides a comprehensive 
tool for QTL repository, comparisons and 
dynamic linking to comparative structural 
genome information. Selected QTLs are 
hypothesized to occur in specific chromo¬ 
somal regions containing genes that make a 
significant contribution to the expression of a 
complex trait. A search of the PigQTLdb for 
health and immune traits (PigQTLdb, 2010b) 
produced numerous QTL map locations for 
blood cell subset frequencies, serum protein 
and antibody responses (Table 6.7). Studies to 
affirm these map locations with pig health 
and resistance/susceptibility to pathogen 
infections have only just begun, as illustrated 


by the work of Reiner et al. (2002, 2007b). 
The next decade will clearly lead to major dis¬ 
coveries, as illustrated by the detailed analyses 
of melanoma resistance by Geffrotin et al. 
(2004), Du et al. (2007) and Gomez-Raya 
et al. (2007). 


Summary 

The genetic diversity of the loci determining 
the immune responses of pigs is only begin¬ 
ning to be explored. The availability of the 
annotated swine genome sequence is expected 
to stimulate many more advances into under¬ 
standing how immune genes affect pig health 
in all phases of production. Future studies will 
reveal the impact of allele variations on swine 
health and pathogen resistance/susceptibility. 
The stage is now set for fundamental studies to 
explore these areas. 
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Introduction 

Since the early days of animal cytogenetics, 
the pig (Sus scrofa, SSC) has been one of the 
most widely studied farm animals. This interest 
has been promoted by the relatively fewer 
chromosomes in the domestic pig (2n = 38) 
compared with other domestic species, such as 
dogs (2n = 78), alpacas (2n = 74), cattle (2n = 
60), goats (2 n = 60), sheep (2n = 54), buffa¬ 
loes (2n = 50), horses (2n = 64) and chickens 
(2n = 78). Furthermore, pig chromosomes are 
readily distinguishable from each other, and it 
is relatively easy to identify most of the chro¬ 
mosomes even without the use of special stain¬ 
ing methods. Porcine chromosomes have 
largely been studied for the analysis of karyo¬ 
typic anomalies in somatic and germ cells, 
gametes and embryos. Additionally, the pig is 
believed to be a good model for the human to 
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study the effect of chromosome aberrations in 
male and female meiosis. 

The 1990s witnessed an increasing inter¬ 
est in gene mapping in all farm animals, includ¬ 
ing pigs. Needless to say that a priori knowledge 
of the porcine chromosomes significantly con¬ 
tributed to enhancing this cause, making the 
porcine genome one of the best studied animal 
genomes by the mid-1990s. The physical and the 
meiotic linkage maps developed for individual 
pig chromosomes during the past two decades 
have played a significant role in understanding 
the organization and evolution of this genome, 
and finding genes and markers of economic 
significance in pigs. The recently launched 
porcine whole genome (WG) sequencing 
project does not mark the end of the era of 
chromosome maps. It is, rather, the opposite: 
good, integrated maps are prerequisites to vali¬ 
date and support the WG sequence assembly. 

©CAB International 2011. The Genetics of the Pig, 
2nd Edn (eds M.F. Rothschild and A. Ruvinsky) 
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Molecular hybridization-based methods, 
initially developed for physical gene mapping, 
have become an integral part of cytogenetic 
analysis, blurring the boundaries between clas¬ 
sical cytogenetics and molecular biology. 
Rapidly evolving array-based genome analysis 
tools further obscure this picture, bringing 
chromosome analysis to submicroscopic levels. 
Fig cytogenetics is no exception; it has stepped 
into the era of ‘new cytogenetics’ or ‘cytoge- 
nomics’ (Speicher and Carter, 2005). 

The present chapter aims to address the 
porcine genome through two main sections. 
The first of these, Pig Chromosomes, deals 
with classical and molecular cytogenetics , with 
the emphasis on the latter. After a brief back¬ 
ground to pig chromosomes, the reader will be 
introduced to the basic concepts of normal and 
abnormal pig chromosomes, and evolutionary 
aspect of the pig karyotype. The second main 
section, Gene Mapping, deals with the genera¬ 
tion of genetic linkage and physical gene 
waps in pigs - from the first mapped locus to 
high-resolution integrated WG maps. This 
information, in combination with the WG draft 
sequence, helps in developing a thorough 
understanding of the organization of the por¬ 
cine genome. 

Pig Chromosomes 

Methods for studying pig chromosomes 

During the past 15 years, an impressive series 
of technical and conceptual advancements 
have shaped human and animal cytogenetics, 
leading to improvement of the resolution and 
accuracy of chromosome studies (Trask, 2002; 
Speicher and Carter, 2005; Dobigny and 
Yang, 2008). Pig cytogenetics today is essen¬ 
tially a combination of conventional chromo¬ 
some analysis methods, such as chromosome 
banding, and a variety of sequencing-, PCR-, 
hybridization- and immunochemistry-based 
molecular approaches. Most of the molecular 
methods were originally developed for genome 
analysis and physical gene mapping, but were 
quickly adopted by cytogeneticists, and applied 
in many different fields of porcine chromo¬ 
some analysis. 


Classical chromosome banding methods 
and pig chromosome nomenclature 

For the past 40 years, various banding tech¬ 
niques, such as QFQ (quinacrine fluorescence 

(Q) -banding) (Caspersson et a/., 1970), GTG 
(Giemsa (G)-Trypsin-banding) (Seabright, 1971), 
RBG (reverse (R)-banding with Giemsa staining) 
(Dutrillaux and Lejeune, 1971) and RBA (reverse 

(R) -banding with acridine orange staining) 
(Dutrillaux, 1973), have been used to identify 
individual porcine chromosomes and arrange 
them into a karyotype. The first karyotype was 
based on Q-banding (Gustavsson et ah, 1972), 
followed by several GTG-banded karyotypes 
(Gustavsson, 1980) that were consolidated 
into the first international standard (Ford et a/., 
1980). The latter was soon revised by adding 
RBA-banding (Lin et a/., 1980; Ronne et a/., 
1987), schematic drawings of G- and R-banded 
chromosomes, and a band nomenclature system 
(Fig. 7.1) (Gustavsson, 1988). It was the first 
comprehensive chromosome nomenclature 
among domestic animals and enabled the 
description of normal karyotypes, aberrant 
patterns, and established the foundation for 
physical chromosome maps. 

The nucleolus organizer regions (NORs) 

The NORs are the chromosomal sites of 5.8S, 
18S and 28S ribosomal RNA (rRNA) genes 
and were originally detected by the Ag-I tech¬ 
nique or NOR-banding (Goodpasture and 
Bloom, 1975; Bloom and Goodpasture, 
1976). The method uses ammoniacal silver 
solution and stains the chromosomal sites con¬ 
taining actively transcribed rRNA genes. More 
recently, NORs have been studied using fluor¬ 
escence in situ hybridization (FISH) with 
ribosomal DNA (rDNA) probes that detect all 
loci of the rRNA genes, regardless of their 
transcriptional status. In pigs, NORs are located 
close to the centromeric regions of SSC8 and 
10 (Lin et a/., 1980; Toga-Piquet et a/., 1984; 
Mellink et ah, 1991). Additionally, a narrow 
intercalary site suggesting a small rRNA gene 
cluster is present on SSC16 (Bosma et ah, 
1991a). Another class of rRNA genes - the 5S 
rRNA genes - has been mapped by in situ 
hybridization (ISH) to SSC14q22 (Lomholt 
et ah, 1995; Mellink et ah, 1996). Numerical 
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Fig. 7.1. Schematic drawings of G-banded porcine chromosomes. Source: Gustavsson (Committee for 
the Standardized Karyotype of the Domestic Pig) (1988). 


variations and polymorphism of Ag-positive 
NORs have been demonstrated in pigs at the 
cellular, individual and population levels (Mellink 
et a/., 1992, 1994). The activity of rDNA 
is correlated with the concentration of oxidative 


agents present in the environment, and quanti¬ 
tation of active NORs can be used to measure 
oxidative stress in porcine cells (Wnuk et a/., 
2008). The genes encoding for all four 
classes of rRNA genes have also been mapped 
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in other suids/Suiformes - the babirusa and 
the white-lipped peccary (Zijlstra et a/., 1997) 
confirming the previously recognized chromo¬ 
somal homologies between the three species 
(Bosma et a/., 1996, 2004). 

Centromeric and telomeric heterochromatin 

Traditionally, regions containing centromeric 
and telomeric heterochromatin have been high¬ 
lighted by the C (centromere)-banding (Sumner, 

1972) or THA (terminal)-banding (Dutrillaux, 

1973) techniques. While C-banding stains 
only pig chromosome centromeres, the THA- 
method stains all centromeres and most of the 
telomeric regions (Gustavsson, 1983). 
Additionally, bright interstitial THA-bands are 
present on SSC2q, 6q and lOp. More recently, 
heterochromatic regions in the pig genome 
have been studied using molecular methods 
such as restriction enzyme digestion (Miller 
et a/., 1993; Adega et a/., 2005), FISH (de la 
Sena et a/., 1995) and primed in situ synthesis 
(PRINS) (Gu et a/., 1996; Rogel-Gaillard et a/., 
1997b; Wnuk et a/., 2008). Telomeric 
(TTAGGG) n repeats localize at the ends of all 
chromosomes, but also interstitially on 
SSC6q22, 6q, 7q, and at the centromeres of all 
acrocentric chromosomes (de la Sena et aI., 
1995; Wnuk et a/., 2008). Restriction enzyme 
analyses show the presence of two distinct cen¬ 
tromeric satellite DNA families (Adega et a/., 
2005), of which the Ac2 family shares some 
similarity with telomeric sequences (Wnuk 
et a/., 2008). Interestingly, the intercalary 
(TTAGGG) n site at 6q coincides with a bright 
THA-band (Gustavsson, 1983), while the 6q 
and 7q sites are at the junction of porcine seg¬ 
ments homologous to (the human segments) 
HSA1 and HSA19, and HSA6 and HSA15, 
respectively (Fronicke et aI., 1996). Overall, 
porcine constitutive heterochromatin is very 
heterogeneous, consists of chromosome-specific 
satellite DNA families and can be divided into 
22 heterochromatin subclasses. Of these, 12 
are centromeric, four interstitial, five telomeric 
and one specific to the long arm of the Y chro¬ 
mosome (Adega et a/., 2005). Polymorphism 
in the amount of centromeric heterochromatin 
has been described both in uniarmed (13-18) 
and biarmed (1, 2, 8, 11 and 12) chromosomes 
(Glahn-Luft et a/., 1982; Eldridge, 1985; 
Gustavsson, 1990). 


Chromosome analysis by fluorescence 
in situ hybridization, FISH 

ISH is widely used in different branches of 
biology; it relies on the Watson-Crick base¬ 
pairing complementarity principle, and permits 
the location of DNA markers in their original 
place or in situ (Raudsepp and Chowdhary, 
2008). The two major components of FISH are 
the probe and the target. In the case of cytoge¬ 
netics and gene mapping, the targets can be 
whole cells, mitotic or meiotic chromosome 
preparations at different stages of the cell cycle, 
DNA fibres (Trask, 2002; Speicher and Carter, 
2005; Raudsepp and Chowdhary, 2008; Rubes 
et a/., 2009) or DNA tiling-arrays (Shaffer and 
Bejjani, 2006; Sharp, 2009). The probes vary 
considerably in size and origin, e.g. from a few 
base pairs of long telomeric or centromeric 
repeats to composite DNA sequences from the 
whole chromosome or chromosomal segment. 
The probes are labelled directly with fluoro- 
chromes or indirectly with molecules that need 
binding by fluorochrome-conjugated antibodies 
to visualize the signals. Over the past 20 years, 
FISH methodology has improved tremendously 
(Nederlof et a/., 1989; Lichter, 1997; Trask, 
2002). Sophisticated imaging systems and 
the available number of spectrally distinct 
fluorochromes allow simultaneous detection of 
two, three or even multiple probes, and the 
overall sensitivity of FISH has increased about 
10,000 times. 

In regard to porcine chromosomes, FISH 
was initially used for physical gene mapping 
(see the second part of the chapter). However, 
during the past 10 years the method has also 
become an integral part of cytogenetic analysis 
(Ducos et a/., 2007, 2008; Rubes et a/., 2009). 
The availability of large insert clones from pig 
genomic libraries (Rubes et a/., 2009) and the 
development of flow-sorted and microdissected 
chromosome-specific probes (see below) have 
been instrumental in the use of FISH in both 
porcine cytogenetics and gene mapping. 

GENERATION OF CHROMOSOME-SPECIFIC PROBES BY 

flow sorting. Among various domesticated 
species, the pig is the most attractive animal for 
flow sorting individual chromosomes, primarily 
due to small chromosomal number, distinct 
shape and size, and variation in GC- and AT-base 
pair ratios. Flow sorting of porcine chromosomes 
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was first carried out in the 1980s (Grunwald 
et ai, 1986, 1989; Matsson et a/., 1986; 
Geffrotin et a/., 1987), and fairly distinct 
separation of the chromosomes from normal 
and translocated cell lines was obtained. 
However, it took a few more years and a dual 
laser fluorescence activated cell sorter (FACS) to 
resolve the pig chromosomes into 19-20 definite 
peaks (Dixon etai, 1992; Schmitz eta/., 1992). 
Chromosome painting (Lichter et a/., 1988; 
Pinkel et a/., 1988) was used to demonstrate 
that five of the 20 clusters observed in the 
bivariate flow karyotype of a male pig corresponded 
to chromosomes 1, 13, 18, X and Y (Langford 
et a/., 1992). This breakthrough was significant 
for the unambiguous correlation of individual 
peaks to particular chromosomes. Later, two 
independent studies (Langford et ai, 1993; Yerle 
etai, 1993) established complete correspondence 
of each of the cluster in the porcine bivariate flow 
karyotype with a specific chromosome, thus 
providing a standard’ flow karyotype for the pig. 
Whole chromosome paints (WCPs) obtained 
through flow cytometry have found good use in 
molecular cytogenetic analyses of various 
chromosomal abnormalities (Table 7.1). 


GENERATION OF CHROMOSOME-SPECIFIC PROBES BY 

microdissection. This is a convenient alter¬ 
native to flow sorting that allows the isolation 
of DNA from whole chromosomes, as well as 
from chromosome arms, segments or bands 
(Chaudhary et ai, 1998b; Chowdhary and 
Raudsepp, 2001). The technique involves 
manual(Meltzer etai, 1992; Guan etai, 1994) 
or laser-assisted (Kubickova et ai, 2002) 
microscopic scraping of the chromosomal 
regions of interest, followed by amplification of 
the microdissected DNA using DOP- or PARM- 
PCR (Telenius et ai, 1992; Milan et ai, 1993), 
or some other whole-genome amplification 
method (Sorensen et ai, 2007). The probes thus 
obtained are devoid of the impurities commonly 
associated with flow-sorted chromosomes. The 
amplified DNA is labelled and used as a painting 
probe for HSH experiments. 

Initially, the aim of chromosome microdis¬ 
section was the construction of chromosome- 
specific libraries for physical gene mapping. 
However, the possibility to microdissect any 
chromosomal segment of interest has made 
the method extremely useful for the identifica¬ 
tion of otherwise intractable chromosomal 


Table 7.1. The use of pig chromosome-specific painting probes for the study of chromosomal 
abnormalities (in chronological order). 


Aberration 

Painting probes 

Probe origin 

Reference 

rcp(7;15) 

SSC7, 15 

Flow sorted 

Konfortova etai., 1995 

rcp(6;13); rcp(11 ;16); 
rcp(6;16); rcp(13;17); 

SSC2, 3, 5, 6, SSC11, 

13, 14, SSC15, 16, 17 

Flow sorted 

Pinton etai., 1998 

rcp(6;14); rcp(3;5); 
rcp(2;14); rcp(15;17) 

39,XXY 

HSAX 

Oncor, Coatasome® 

Makinen etai., 1998 

rcp(4;12); rcp(1;7); 

SSC1,4, 6, 7, 12 

Flow sorted 

Pinton, A. etai., 2000 

rcp(1 ;6) 

rcp(4;6); rcp(2;6); 
rcp(5; 17); rcp(5;8); 

SSC2,3, 4, 5, 6, SSC7, 

8, 15, 17 

Flow sorted 

Ducos etai., 2007 

rcp(15;17); rcp(7;8); 
rcp5;8); rcp(3;15) 

rcp(7;18) 

All SSC chromosomes 

Microdissected 

Kubickova etai., 2002 

SSC4 pericentric 

SSC4p, 4q 

Microdissected 

Pinton etai., 2003 

inversion 

Structural aberrations 

SSC1,13 

Microdissected 

Rezacova etai., 2003 

rcp(3;15) and rcp(12;14) 

SSC3, 12, 14, 15 

Flow sorted (SSC3, 12, 
14); Microdissected 
SSC15 

Pinton etai., 2005 

rcp(Y;14) 

SSC14, SSCY 

Flow sorted 

Pinton etai., 2008 

rob(13; 17) 

SSC13, 17 

Flow sorted 

Pinton etai., 2009 


rep, reciprocal translocation; SSC, Sus scrota ; HSA, Homo sapiens-, rob, Robertsonian translocation. 
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rearrangements, such as minute marker chromo¬ 
somes and minute deletions and translocations. 
In pigs, painting probes produced by microdis¬ 
section has been used to study translocations 
(Kubickova et al., 2002; Rezacova et al., 2003; 
Finton et a/., 2005) and intrachromosomal rear¬ 
rangements (Pinton et al., 2003) (Table 7.1). 

sperm-fish. Sperm-FISH is carried out on 
decondensed sperm heads and has become a 
state-of-the-art technique to analyse the 
chromosomal constitution of sperm. In pigs, 
sperm-FISH has been used to validate the purity 
of flow cytometrically sorted boar sperm 
(Kawarasaki et a/., 2000; Parrilla et a/., 2003) 
but also to estimate the rate of aneuploidies in 
normal individuals (Rubes et a/., 1999), and 
analyse meiotic segregation in translocation 
and inversion carriers (Pinton et al., 2004; 
Massip et ai, 2008, 2009; Bonnet-Garnier 
et al., 2009). 

COMPARATIVE GENOMIC HYBRIDIZATION (CGH). CGH 

was originally designed to overcome the 
difficulties associated with preparing high-quality 
metaphase spreads from some types of cells, 
such as various solid tumours. The technique 
was developed by a team lead by Ollie and Anna 
Kallioniemi, Dan Pinkel and Joe Gray (Kallioniemi 
et al., 1992) and uses competitive hybridization 
of two differently labelled (red and green) DNA 
probes (e.g. DNA from a normal cell and from a 
cancer cell) to normal metaphase chromosomes. 
The ratio of red-to-green fluorescence is 
measured along each chromosome, and regions 
with deletions and amplifications are determined. 
In porcine cytogenetics, chromosome-CGH has 
been used in the detection of the loss of 
SSC13q36-q39 in swine cutaneous melanoma 
(Apiou et ai, 2004) and in investigating 
chromosome aneuploidies in early embryos 
(Hornak et ai, 2009). 

A new and promising refinement of the 
chromosome-CGH technology is array-CGH, 
in which metaphase chromosomes are replaced 
by an array of thousands of bacterial artificial 
chromosome (BAC) clones or short oligonucleo¬ 
tide sequences that form a tiling path over the 
entire genome or the genomic region of interest 
(Trask, 2002; Albertson and Pinkel, 2003; 
Speicher and Carter, 2005). It is anticipated 


that array-CGH will considerably improve the 
detection of disease-related complex chromo¬ 
somal rearrangements (Shaffer and Bejjani, 
2006; Lee et ai, 2007; Sharp, 2009). It is also 
believed that array-CGH will help to uncover 
the extent of naturally occurring structural varia¬ 
tions in genomes between normal individuals 
(Redon et al., 2006). Recently, array-CGH was 
for the first time used in pigs to get a snapshot 
of copy number variation (CNV) in the pig 
genome (Fadista et al., 2008). A custom tiling 
oligonucleotide array with a median probe spac¬ 
ing of 409 bp was designed for four porcine 
chromosomes, namely SSC4, 7, 14 and 17. 
Array-CGH analysis was carried out on 12 Duroc 
boar founders and one unrelated Hampshire 
boar, and 37 CNV regions were identified across 
the four chromosomes. Notably, some CNVs 
overlapped with known segmental duplications. 

Immunolocalization of chromosomal proteins 

The combined use of fluorescently labelled anti¬ 
bodies for the synaptonemal complex proteins 
SCP1 and SCP3, the mismatch repair protein 
MLH1 present at late recombination nodules 
(Barlow and Hulten, 1998), and the yH2AX 
protein that is associated with asynapsed chro¬ 
matin and meiotic silencing (Fernandez- 
Capetillo et al., 2003) has considerably 
improved knowledge about the behaviour of 
meiotic chromosomes in normal cells, and in 
cells with chromosomal aberrations in humans, 
mice and chickens (reviewed by Villagomez and 
Pinton, 2008). The pig is so far the only domes¬ 
tic animal in which fluorescence immunocyto- 
chemistry has been used to study chromosomally 
abnormal spermatocytes. In one study, the 
meiotic behaviour of rob(13;17) was analysed 
(Pinton et al., 2009), while another showed the 
extent of chromatin silencing in a case of Y;14 
translocation (Pinton et al., 2008). 

Chromosomal abnormalities - an 

overview 

As observed in other species, chromosome 
aberrations have also been found in pigs. These 
aberrations can be classified into two major cat¬ 
egories: numerical and structural. Chromosome 
abnormalities associated with abnormal sexual 
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development, and cytogenetic anomalies found 
in porcine germ cells, gametes and embryos are 
discussed separately at the end of this section. 

Numerical aberrations 

Chromosome euploidy, which is characterized 
by variation of the whole haploid complement, 
has been found only in porcine sperm, ova and 
zygotes (discussed below). In contrast, aneu- 
ploidy is associated with individual chromosomes 
and can involve either the autosomes or the sex 
chromosomes. 

sex chromosome aneuploidy. This is relatively 
infrequent in pigs and only a few cases have 
been described. Among these are 37,XO 
X-monosomy (Nes, 1968; Lojda, 1975), 
39,XXY and 40,XXXY Kleinfelter syndrome 
(Breeuwsma, 1968; Hancock and Daker, 1981; 
Gustavsson, 1984; Makinen et a/., 1998), and 
mosaic karyotypes such as 39,XXY/40,XXXY 
(Breeuwsma, 1970; Hancock and Daker, 1981) 
and 37,X/38,XY/39,XYY (Quilter et a/., 2003). 
Individuals with sex chromosome aneuploidy 
may phenotypically appear male, female or 
intersex, but always demonstrate some abnor¬ 
malities of the gonads or external genitalia. 

autosomal aneuploidy. Autosomal aneuploidy 
appears to be a non-existent phenomenon in 
live-born piglets. So far, only animals with 
mixoploid conditions, such as 37,XY-18/ 
38,XY/39,XY+18, 38,XY/37,XY-18 (Vogt 
et a/., 1974) and 38,XY/39,XX+14 (Bosch 
et a/., 1985), have been described. However, 
the condition exists at embryonic stages. 
Monosomy of SSC11 (Smith and Marlowe, 
1971) and double trisomy of SSC17 and 18 
(Ruzicska, 1968) were identified using traditional 
cytogenetic methods. Double trisomy of SSC1 
and 10 at a blastomere stage was analysed 
using FISH with porcine chromosome specific 
probes (Zudova et a/., 2003). 

Structural aberrations 

Structural chromosomal rearrangements are 
common in pigs and can be as frequently as 
0.47% (Ducos et a/., 2007). Many of these 
abnormalities, mainly reciprocal translocations, 
were discovered by I. Gustavsson and co-workers 


during the 1970s and 1980s. Since then, the 
world centre of porcine cytogenetics has shifted 
to INRA (Institut National de la Recherche 
Agronomique) in France, to the Cytogenetic 
Laboratory at the National Veterinary School 
of Toulouse. Thanks to nationally funded 
programmes, the laboratory carries out chro¬ 
mosome analyses for about 1500-2000 ani¬ 
mals per year, analysing mainly young purebred 
boars for artificial insemination (AI) centres 
(Ducos et a/., 2002, 2007, 2008). Over the 
period of 2002-2006 the laboratory has identi¬ 
fied over 66 new reciprocal and Robertsonian 
translocations and eight inversions (Ducos 
et a/., 2007, 2008). Systematic cytogenetic 
screening programmes of AI boars are also 
ongoing in Poland, the Netherlands and 
Hungary, and occasionally in Finland and 
Portugal, thus providing an additional input to 
the discovery of structural chromosomal abnor¬ 
malities in pigs (Ducos et a/., 2008). 

reciprocal (rcp) translocations. There are 
over 130 different types of rcp translocations 
known today (Table 7.2), and these are the most 
frequently reported chromosomal rearrange¬ 
ments in pigs. Analysis of the data indicates that 
the distribution of participating chromosomes is 
not random. Chromosomes 1, 7, 14 and 15 are 
involved most frequently, and SSC10, 12 18, X 
and Y the least frequently. Furthermore, some 
chromosomal bands are more prone to breakages 
than others; for example, band lq21 participates 
in rcp translocations with five different chromo¬ 
somes, and band 7q24 is more frequently 
involved than six other break-points on the same 
chromosome (Table 7.2). 

Irrespective of the chromosomes involved, 
rcp translocations result in the production of a 
variety of balanced and unbalanced gametes 
(McClintock, 1945; Ford and Clegg, 1969; 
King, 1980; Pinton et a/., 2005; Villagomez 
and Pinton, 2008; Villagomez et a/., 2008) 
which is considered to be the main reason for 
reduced fertility in translocation carriers 
(Gustavsson, 1990; Pinton, A. et a/., 2000; 
Villagomez et a/., 2008). Litters of such 
animals are 25-50% smaller, mainly because 
of the early loss of zygotes with chromosomal 
imbalances (Akesson and Henricson, 1972; 
Hageltorn et a/., 1976; King et a/., 1981; 
Popescu and Boscher, 1982; Gustavsson, 
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Table 7.2. Comprehensive summary of reciprocal (rep) translocations detected in pigs, showing the 
participating chromosomes, location of break-points, breeds involved and references. 


No. 

Rep translocation 

Breed 3 

Reference(s) 

1 

(1:3)(p;q) 

LW, RU 

Konovalov etal., 1987 

2 

(1 ;4)(q27;q21) 

P 

Ducos etal., 2007 

3 

(1;5)(p21;q21) 

Polish 

Danielak-Czech etal., 1997 

4 

(1 ;6)(p11 ;q11) 

LW, SWE 

Yang etal., 1992 

5 

(1 ;6)(p11 ;q35) 

LW 

Locniskar etal., 1976 

6 

(1 ;6)(q12;q22) 

GA 

Ducos etal., 1997a, 1998a 

7 

(1 ;7)(q17;q13) 

ni 

Ducos etal., 2007 

8 

(1 ;7)(q2.13;q24) 

LR, SWE 

Gustavsson etal., 1988 

9 

(1 ;8)(p13;q27) 

Y, SWE 

Gustavsson etal., 1982 

10 

(1;9)(p;p) 

LW 

Ducos etal., 1997a, 1998a 

11 

(1 ;10)(q2.11 ; P 15) 

ni 

Ravaoarimanana etal., 1992 

12 

(1;11)(p23;q15) 

LR, FIN, BL 

Kuokkanen and Makinen, 1988; 

Tzocheva, 1994 

13 

(1;11)(q;q) 

LR, SWE 

Flansen-Melander and Melander, 1970 

14 

(i; 11 )(qi i ;qi i) 

P 

Ducos etal., 2007 

15 

(i ;i 1 )(q24;p13) 

ni 

Ducos etal., 2007 

16 

(1 ;13)(q27;q41) 

DU 

Ducos etal., 2007 

17 

(1; 14)(p25;q15) 

Y, SWE 

Gustavsson, 1984 

18 

(1 ;14)(q17;q21) 

LW 

Tarocco etal., 1987 

19 

(1; 14)(q2.12;q22) 

ni 

Zhang etal., 1992 

20 

(1; 14)(q23;q21) 

SML 

Golish etal., 1982 

21 

(1 ;15)(p25;q13) 

LR, FIN 

Kuokkanen and Makinen, 1988 

22 

(1 ;15)(q17;q22) 

LS 

Ducos etal., 2007 

23 

(1;15)(q27;q26) 

LW 

Popescu etal., 1988 

24 

(1 ;16)(q11 ;q11) 

LR 

Forster etal., 1981 

25 

(1 ;17)(p11 ;qi 1) 

P 

Ducos etal., 2007 

26 

(1 ;17)(q21 ;q11) 

Y, SWE 

Gustavsson, 1984 

27 

(1 ;18)(q;q) 

H 

Villagomez etal., 1991 

28 

(2;4)(p17;q 11) 

ni 

Gustavsson etal., 1982 

29 

(2;6)(p17;q27) 

Y 

Ducos etal., 2002 

30 

(2;8)(p11 ;p13) 

LW 

Ducos etal., 2007 

31 

(2;9)(q13;q24) 

LW 

Ducos etal., 2007 

32 

(2;9;14)(q23;q22;q25) 

ni 

Makinen etal., 1997 

33 

(2; 14)(p14;q23) 

ni 

Villagomez, 1993 

34 

(2;14)(p15;q26) 

X 

Ducos etal., 2007 

35 

(2; 14)(q13;q27) 

LF 

Ducos etal., 1998b 

36 

(2; 14)(q21 ;q24) 

ni 

Ducos etal., 2007 

37 

(2; 15)(p13;q24) 

LR, FIN 

Makinen etal., 1987 

38 

(2;15)(q28;q24) 

ni 

Ducos etal., 2007 

39 

(2;16)(q28;q21) 

SE 

Ducos etal., 2007 

40 

(2; 17)(p12;q14) 

DU 

Ducos etal., 2007 

41 

(3;5)(p13;q23) 

LF 

Ducos etal., 1998b 

42 

(3;6) (p14;q21) 

ni 

Villagomez etal., 2008 

43 

(3;7)(p13;q21) 

LW 

Gabriel-Robez etal., 1988 

44 

(3;7)(p;q) 

Indian 

Ducos etal., 1997a 

45 

(3;8)(q25;p21) 

ni, LS 

Ducos etal., 2007 

46 

(3; 11)(q13;p11) 

LS 

Ducos etal., 2007 

47 

(3;13)(p15;q31) 

LW 

Ducos etal., 1998a 

48 

(3; 15)(q27 ;q13) 

LW 

Ducos etal., 2002 

49 

(3;16)(q23;q22) 

X 

Ducos etal., 2007 

50 

(4;5)(p13;q21) 

X 

Ducos etal., 2007 

51 

(4;6)(q21;p14) 

p 

Ducos etal., 2002 


Continued 
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Table 7.2. Continued. 


No. Rep translocation Breed 3 Reference(s) 


52 

(4;6)(q2l;q28) 

LW 

53 

(4; 12)(q21 ;q13) 

X 

54 

(4;13)(p15;q41) 

ni 

55 

(4; 13)(q25;q41) 

LR, FIN 

56 

(4; 14)(p11 ;q11) 

LWxLF 

57 

(4; 14)(q;q) 

Indian 

58 

(4;15)(q;q) 

P 

59 

(4;15)(q25;q11) 

LF 

60 

(4;16)(q25;q21) 

LS 

61 

(5;7)(q23;p11) 

SE 

62 

(5;8)(p11 ;p23) 

SE 

63 

(5;8)(p12;q21) 

LW 

64 

(5;8)(q12;q27) 

Y, SWE 

65 

(5,9)(p11 ;p24) 

LF 

66 

(5;9)(q21;p13) 

LF 

67 

(5; 14)(q11 ;q) 

H x P 

68 

(5,14)(q21 ;q12) 

DU 

69 

(5;15)(q25;q25) 

LR 

70 

(5; 17)(p12;q13) 

Y 

71 

(6;8)(q33;q26) 

GAx MS 

72 

(6;8)(p15;q27) 

+ (10; 18)(p11 ;q24) 

P 

73 

(6;13)(p15;q41) 

LF 

74 

(6;13)(p13;q49) 

LW 

75 

(6; 14)(p11 ;q11) 

LW x Essex 

76 

(6;14)(q27;q21) 

LW x P 

77 

(6;15)(p;q) 

LR, FIN 

78 

(6;15)(p15;q13) 

P x LW 

79 

(6; 16)(q11 ;q11) 

SML 

80 

(7 ;8)(q13;q27) 

ni 

81 

(7;8)(q24;p21) 

3/4LF,1/4MS 

82 

(7;9)(q11;q26) 

LW 

83 

(7;9)(q15;q15) 

LW 

84 

(7; 11 )(q21 ;q11) 

Y, SWE 

85 

(7;12)(q11 ; P 15) 

P 

86 

(7;12)(q24;q15) 

Y, FIN 

87 

(7; 13)(p13;q21) 

H 

88 

(7;13)(p13;q46) 

Polish 

89 

(7;14)(q15;q27) 

X 

90 

(7;14)(q26;q25) 

ni 

91 

(7; 15)(q24;q 12) 

LW 

92 

(7;15)(q24;q26) 

ni 

93 

(7;17)(q26;q11) 

H 

94 

(8; 10)(p11 ;q13) 

ni 

95 

(8; 12)(p11 ; P 11) 

ni 

96 

(8;13)(q27;q36) 

ni 

97 

(8;14)(p23;q27) 

ni 

98 

(8;14)(p21;q25) 

Polish 

99 

(9; 11 )(p24;q 11) 

Y, SWE 

100 

(9; 11 )(q14;p13) 

MS 

101 

(9; 14)(p14;q23) 

Polish 

102 

(9; 14)(p24;q 15) 

LS 

103 

(9;15)(p24;q13) 

LWx LF 


Ducos etal., 1998b 
Ducos et al., 2007 
Ducos et al., 2007 
Makinen and Remes, 1986 
Popescu et al., 1984 
Ducos et al., 1997a 
Popescu et al., 1988 
Ducos et al., 2007 
Ducos et al., 2007 
Ducos et al., 2007 
Ducos et al., 2002 
Ducos et al., 2002 
Gustavsson, 1984 
Ducos et al., 2007 
Ducos et al., 2007 
Popescu andTixier, 1984 
Ducos et al., 2007 
Parkanyi etal., 1992 
Ducos et al., 2002 
Bonneau et al., 1991 
Ducos et al., 2007 

Ducos et al., 1998b 
Ducos et al., 2007 
Madan et al., 1978 
Ducos et al., 1998a 
Bouters et al., 1974 
Bonneau et al., 1991 
Ducos et al., 1998a 
Ravaoarimanana etal., 1992 
Ducos et al., 2002 
Ducos et al., 2007 
Ducos et al., 2007 
Gustavsson et al., 1982 
Ducos et al., 2007 
Kuokkanen and Makinen, 1987 
Gustavsson etal., 1988 
Danielak-Czech etal., 1997 
Ducos et al., 2007 
Ducos et al., 2007 

Popescu etal., 1984; Konfortova et al., 1995 

Makinen et al., 1997 

Villagomez et al., 1995a 

Makinen et al., 1999 

Ducos et al., 2007 

Ravaoarimanana et al., 1992 

Ravaoarimanana etal., 1992 

Danielak-Czech etal., 1997 

Gustavsson et al., 1982 

Ducos et al., 2007 

Rejduch et al., 2003 

Ducos et al., 2007 

Ducos et al., 1998a Continued 
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Table 7.2. Continued. 


No. 

Rep translocation 

Breed 3 

Reference(s) 

104 

(9; 17) (p24;q23) 

X 

Ducos et ai, 2007 

105 

(10;11 )(q16;q13) 

P 

Ducos et at., 2007 

106 

(10; 13)(q11 ;q11) 

Polish 

Danielak-Czech etal., 1996 

107 

(10;13)(q13;q22) 

LS 

Ducos etal., 2007 

108 

(10; 13)(q16;q21) 

LW x P x DU x H 

Danielak-Czech and Slota, 2007 

109 

(10; 17)(q11 ;q21) 

ni, LW 

Ducos etal., 2007 

110 

(11 ;13)(q;q) 

LW 

Ducos etal., 1998a 

111 

(11 ;15)(p15;q13) 

LR, FIN 

Henricson and Backstrom, 1964 

112 

(11; 16)(p14;q14) 

LW x P 

Ducos etal., 1998a 

113 

(11 ;17)(p13;q21) 

SE 

Ducos etal., 2007 

114 

(12;13)(q13;q11) 

Minisib 

Astachova etal., 1988 

115 

(12,14)(q13;q15) 

DU 

Ducos etal., 2007 

116 

(12; 14)(q15;q13) 

P 

Ducos etal., 2007 

117 

(12; 15)(q;q) 

LW, RU 

Konovalov etal., 1987 

118 

(13; 14)(q21 ;q27) 

Y, SWE 

Hageltorn etal., 1976 

119 

(13;15)(q31;q26) 

P 

Ducos etal., 2007 

120 

(13; 16)(q41 ;q21) 

ni 

Ducos etal., 2007 

121 

(13;17)(q4l;q11) 

LF 

Ducos etal., 1998b 

122 

(14; 15)(q28;q13) 

SE, ni 

Ducos etal., 2007 

123 

(14;15)(q29;q24) 

ni, H 

Golish etal., 1982; Gustavsson and 

Jonsson, 1992 

124 

(14; 16)(q13;q21) 

P 

Ducos etal., 2007 

125 

(15;16)(q26;q21) 

Y 

Gustavsson etal., 1988 

126 

(15; 17)(q13;q21) 

LW 

Ducos etal., 1998a 

127 

(15; 17)(q24;q21) 

LF 

Ducos etal., 2002 

128 

(16; 17)(q23;q21) 

LR x DU 

Popescu and Boscher, 1986; Astachova 
etal., 1991 

129 

(17,18)(q21 ;q11) 

P 

Ducos etal., 2007 

130 

(X;13)(q24;q21) 

LR x Viet, H 

Gustavsson etal., 1989 

131 

(X: 14)(p;q) 

ni 

Singh etal., 1994; Neal 
etal., 1998 

132 

(Y;14)(q10;q11) 

DU 

Ducos etal., 2007 


a Breed abbreviations: BL, Belgian Landrace; DU, Duroc; FIN, Finnish; GA, Gascon; H, Hampshire; LF, French Landrace; 
LR, Landrace; LS, Synthetic line; LW, Large White; MS, Meishan; ni, not indicated; P, Pietrain; RU, Russian; SE, 
Sino-European; SML, Synthetic male line; SWE, Swedish; Viet, local breed from Vietnam; Y, Yorkshire; X, crossbred. 


1983; Gustavsson and Settergren, 1983). Notably, 
the phenotype and semen characteristics of 
translocation carriers appear normal, though 
in some cases degenerative changes in the tes¬ 
ticles have been found (Chandley et ai, 1972; 
Gustavsson et a/., 1989; Villagomez, 1993). 

Genome sequences and comparative 
maps of mammals have provided deeper 
insight into the molecular nature of chromo¬ 
somal break-points (Kemkemer et a/., 2009; 
Larkin et a/., 2009). Some break-points, such 
as evolutionary break-point regions (EBRs), are 
evolutionarily conserved and ‘reused’ across 
species. The EBRs are frequently associated 
with structural variations, SNPs (single nucleo¬ 


tide polymorphisms) and retrotransposons 
(Kemkemer et a/., 2009; Larkin et a/., 2009), 
but also with some common fragile sites and 
cancer break-points (Ruiz-Herrera et a/., 
2006). Correlation between the known trans¬ 
location break-points in pigs with mammalian 
EBRs is yet to be determined, though the 
human-pig Zoo-FISH map (Rettenberger 
et a/., 1995; Fronicke et a/., 1996; Goureau 
et a/., 1996) does not show any clear corre¬ 
spondence between the rep translocation and 
evolutionary synteny break-points. However, 
correlation has been found between the trans¬ 
location break-points and fragile sites in pigs 
(Riggs et ai, 1993; Ronne, 1995). 
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Robertsonian (rob) translocations. Robertsonian 
(rob) or centric-fusion translocations are 
relatively rare in pigs, and involve four of the 
six acrocentric porcine chromosomes, corre¬ 
sponding to rob(13/17), rob(14/15) and 
rob(14/17) (Ducos et a/., 2007). Among these, 
rob(13/17) is most frequent and has been 
described in normal pigs (Miyake et a/., 1977; 
Alonso and Cantu, 1982; Pinton etal., 2009), 
intersex animals (Masuda et a/., 1975) and 
malformed piglets (Miyake et a/., 1977). Like 
the carriers of rep translocations, animals with 
rob translocations demonstrate varying degrees 
of decrease in litter size (Schwerin et a/., 1986; 
McFeely et a/., 1988). Notably, the rate of 
unbalanced gametes in the case of rob(13/17) 
is higher in females (28.91%) than in males 
(3.21%) (Pinton et a/., 2009). Besides affecting 
fertility in domestic pigs, rob translocations 
have played an important role in chromosome 
evolution among Suidae species (see Table 7.4, 

pp. 148-149). 

duplications and deletions. Duplications and 
deletions of whole chromosomes or chromosome 
arms, mainly caused by reciprocal translocations 
and inversions, are rarely observed in live pigs 
or piglets. This is probably because embryos 
fertilized by such games will suffer from lethal 
overdoses or underdoses of many genes. The 
few cases described so far are: deletion of a 
whole arm of a medium-sized bi-armed 
chromosome in 10-day-old embryos of cytoge¬ 
netically normal parents (McFeely, 1966), and 
monosomy of SSC17 and double trisomy in 


piglets from an rep translocation carrier 
(Villagomez et ai, 1995a). All piglets, except 
one, died within a day of birth. The survivor 
with the double trisomy had normal body 
conformation and testis size, though 17.5% of 
the spermatozoa of this boar demonstrated 
acrosomal defects (Villagomez et a/., 1995b). 

inversions. Inversions have been described for 
only six porcine chromosomes, namely SSC1, 
2, 4, 6, 8 and 9 (Ducos et a/., 1997b, 2008; 
Massip et a/., 2009) (Table 7.3). Recently, the 

first study of the meiotic segregation pattern of 
inversions in pigs was carried out using FISH 
with telomeric probes on condensed sperm 
nuclei (Massip et a/., 2009). The estimated 
proportion of recombinant gametes was very 
low for all inversions studied, and no correlation 
was found between the size of the inverted 
fragment and the proportion of recombinant 
gametes. The authors conclude that inversions 
have very little impact on the reproductive 
performance in carrier pigs, which explains 
why so few cases have been detected so far. 

fragile sites (fs). FS are heritable chromo¬ 
somal loci prone to breakage under in vitro 
induction, and have been relatively well studied in 
pigs (Riggs and Ronne, 2009). So far, 11 
aphidicolin-induced (Riggs et ai, 1993), 24 
folate-sensitive (Yang and Long, 1993) and 25 
BrdU (bromodeoxyuridine)-induced (Ronne, 1995) 
FS have been identified, and a composite karyotype 
showing the location of 60 different FS has been 
generated (Ronne, 1995). As discussed above, 


Table 7.3. Summary of pericentric and paracentric chromosome inversions found in pigs. 


Chromosomes involved Inversion type Reference(s) 


inv(1)(p21;q2.10) 

Pericentric 

inv(1)(p22;q11) 

Pericentric 

inv(1)(p24;q29) 

Pericentric 

inv(1)(q12;q24) 

Paracentric 

inv(1)(q18;q24) 

Paracentric 

inv(2)(p11 ;q11) 

Pericentric 

inv(2)(p11 ;q21) 

Pericentric 

inv(2)(p13;q12) 

Pericentric 

inv(2)(q13;q25) 

Paracentric 

inv(4)(p14;q23) 

Pericentric 

inv(6)(p14;q12) 

Pericentric 

inv(8)(p11 ;q25) 

Pericentric 

inv(8)(p21 ;q11) 

Pericentric 

inv(9)(p12;p22) 

Paracentric 


Massip etal., 2009 

Danielak-Czech etal., 1996 

Ducos etal., 2008; Massip etal., 2009 

Massip etal., 2009 

Ducos etal., 2008 

Massip etal., 2009 

Ducos etal., 2008; Massip etal., 2009 
Ducos etal., 2008 

Ducos etal., 2008; Massip etal., 2009 

Ducos etal., 1997b; Pinton etal., 2003 

Ducos etal., 2008 

Ducos etal., 2008 

Ducos etal., 2008 

Pinton etal., 2002 
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some FS coincide with reciprocal translocation 
break-points (Riggs et a/., 1993; Ronne, 1995; 
Riggs and Ronne, 2009). Comparative analysis 
has identified eight putative orthologous FS 
between humans and pigs on SSClp21, lp25, 
2q23, llql2, 16q21, 18p21, Xp21 and Xq21 
(Riggs and Ronne, 2009). Of these, the FS on 
SSCllql2 (HSA13q21) has been conserved in 
mammalian evolution and is shared with 
corresponding sites in river buffaloes, cattle, 
horses and rabbits (Riggs and Ronne, 2009). 
Interestingly, all eight orthologous FS are located 
in the middle of human-pig synteny segments 
(Fronicke et a/., 1996) and are not associated 
with evolutionary synteny break-points. 


Chromosome abnormalities associated with 
abnormal sexual development and infertility 

intersexuality and sex reversal. These are 
relatively common conditions in pigs, with a 
frequency ranging from 0.1 to 0.6% in porcine 
populations (Pailhoux et a/., 1994; Pinton 
et a/., 2002). Such animals are usually infertile 
(Bosch et a/., 1985), can have testes, ovaries 
or ovotestes (Gustavsson, 1990), and most 
have a normal female 38,XX karyotype 
(Miyake, 1973; Hunter, 1996; Hunter and 
Greve, 1996; Pailhoux et a/., 1997, 2001a,b; 
Villagomez et a/., 2009) with no SRY gene or 
any other Y chromosome markers (Thomsen 
and Poulsen, 1993; Switonski et a/., 2002). 
Three intersex cases, however, have been 
associated with structural rearrangements on 
one homologue of SSC9p. One animal had a 
break (Tambasco et a/., 1990) and the other 
two a paracentric inversion inv(9)(pl2;p22) 
(Pinton et a/., 2002). Using FISH with SSC9p 
markers, the inversion break-point was 
narrowed down to 76.5cR (Pinton et a/., 
2002). The genetic causes of the abnormal 
phenotype, though, remain unknown, because 
the homologous region on HSAllq23 in 
humans contains no genes associated with 
intersex or sex reversal. While the genetic 
background to 3 8, XX intersexuality/sex 
reversal in pigs remains obscure, there is 
evidence that the condition may be inherited 
(Sittmann et a/., 1980; Pailhoux et a/., 1997). 
This is in line with the studies of exotic Vanuatu 
sacred intersex pigs where the trait is of 
maternal inheritance (Lum et a/., 2006). 


chimerism. Chimerism in blood leucocytes or in 
the whole body is either rare or has not been 
detected in pigs. This is probably because all 
males and the majority of females with XX/XY 
leucochimerism are phenotypically normal 
(Bruere et a/., 1968; Somlev et a/., 1970; 
Toyama, 1974; Christensen and Nielsen, 1980; 
Clarkson et a/., 1995; Padula, 2005). Some 
chimeric females, however, have inguinal hernia 
and ovarian aplasia (Bosma et a/., 1975). A few 
cases of XX/XY (Basrur and Kanagawa, 1971) 
and XX/XXY (Toyama, 1974) whole body 
chimerism have also been described. 

Cytogenetics of gametes and embryos 

In domestic pigs, where fertility is the trait of 
particular interest, a considerable amount of 
cytogenetic research is focused on the analysis 
of chromosomes in the prophase of male meio- 
sis and in sperm, oocytes and embryos (King, 
2008; Villagomez and Pinton, 2008). Porcine 
gamete and embryo cytogenetics has gained 
particular importance with the development of 
oocyte in vitro maturation and fertilization, and 
somatic cell nuclear cloning. Improved molecu¬ 
lar methods, such as fluorescence protein 
immunostaining and FISH, have considerably 
improved the accuracy and quality of these 
studies in pigs (Pinton et al., 2008, 2009). 

SYNAPTONEMAL COMPLEX (SC) ANALYSIS. SC 

analysis of the sperm of boars with 
translocations has been instrumental in 
understanding how structurally abnormal 
chromosomes pair in meiotic prophase, and 
how unbalanced gametes are produced 
(Gustavsson et a/., 1988, 1989; Villagomez, 
1993; Koykul et al., 2000; Villagomez and 
Pinton, 2008; Villagomez et al., 2008; Pinton 
et al., 2009). An additional outcome of this 
work has been the preparation of an SC 
karyotype of boar spermatocytes (Villagomez, 
1993) (Fig. 7.2). An unusual, but relatively 
frequently observed, feature of rep translocation 
carrier pigs is the phenomenon of early 
heterosynapsis, in which SCs are formed 
between non-homologous chromosomes or 
chromosome segments without previous 
homosynapsis at the early stages of pachytene 
(Villagomez and Pinton, 2008; Villagomez 
et al., 2008). Early heterosynapsis is also 
common in sex chromosome pairing in boar 
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Fig. 7.2. Synaptonemal complex (SC) karyotype 
of the domestic pig. The bivalents have been 
arranged according to the system used for 
mitotic chromosomes (Villagomez, 1993). Bar 
indicates 5 pm. 

spermatocytes (Villagomez, 1993), and is 
believed to alleviate apoptosis, which is 
otherwise a common consequence of 
chromosome pairing failure in meiosis (Koykul 
et a/., 2000; Villagomez and Pinton, 2008; 
Villagomez et a/., 2008). Studies in humans 
suggest that rep translocations might also 
induce interchromosomal effects (ICE) by 
interfering with the pairing of other 
chromosomes, leading to disjunction and 
aneuploidy (Blanco et a/., 2000). Putative ICEs 


were recently analysed in rcp(3;15)(q27;ql3) 
and rcp(12;14)(ql3;q21) in boars using sperm- 
FISH with painting probes for SSC1, 20, 11, 
13, 18, X and Y (Bonnet-Garnier et a/., 2009). 
Except for SSC1 in the case of rcp(3;15), no 
other ICEs were detected. Overall, it is 
suggested that as balanced constitutional rep 
translocations are the most common structural 
chromosomal rearrangements in humans and 
pigs, the pig is a good model to investigate the 
consequences of such rearrangements on 
meiotic segregation in both male and female 
humans (Pinton et a/., 2005; Massip et a/., 
2008; Bonnet-Garnier et a/., 2009). 

CYTOGENETIC ANALYSIS OF MATURE SPERM. This 

has been carried out using the zona-free 
hamster oocyte assay (Bird and Houghton, 
1990) and, more recently, sperm-FISH (see 
above) with pig chromosome specific painting 
probes (Rubes et a/., 1999; Kawarasaki 
et a/., 2000; Parrilla et a/., 2003; Pinton et a/., 
2004, 2005; Massip et a/., 2008). Notably, 
the frequency of structural aberrations in 
porcine sperm is about 7.1%, and exceeds that 
of numerical abnormalities, which is about 
4.4% (Bird and Houghton, 1990). 

CYTOGENETIC ANALYSIS OF PIG OOCYTES AND WHOLE 

pre-implantation embryos. The cytogenetic 
analysis of pig oocytes and whole pre-implantation 
embryos dates back almost 40 years to when the 
techniques for the preparation of chromosomes 
from these cell types were developed (McFeely, 
1967; McGaughey and Polge, 1971). Despite 
the early start, data on the occurrence of 
chromosome abnormalities in porcine oocytes 
and embryos are limited (Sosnowski etai, 2003; 
Zudova et a/., 2003; Lechniak et a/., 2007; 
Malekinejad et a/., 2007; Boulanger et a/., 
2008). The subject, however, is of importance 
because the quality of in vitro produced embryos 
is low and this is partially attributed to the 
increased frequency of chromosome abnor¬ 
malities in in vitro matured oocytes (Nagai et a/., 
2006). Recent studies show that the frequency of 
diploidy and aneuploidies in secondary oocytes is 
correlated with the duration of maturation and 
the age of the animals. For example, over 15% 
of oocytes are diploid after maturation for 40 h, 
while only 9% are diploid when in vitro 
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maturation lasts 30-36 h (Sosnowski et ai, 
2003). Further, oocytes of gilts have over eight 
times more aneuploidies (10.8%) than the 
oocytes of sows (1.3%) (Lechniak eta/., 2007). 
The chromosome most frequently involved in 
aneuploidies is SSC10, suggesting that, as in 
humans, non-disjunction more frequently involves 
small chromosomes. 

The percentage of chromosomally abnor¬ 
mal in vivo produced morulae/blastocysts in 
pigs is about 11% (Zudova et ai, 2003), while 
the overall incidence of embryos with abnormal 
chromosomal make-up is 5% (McFeely, 1967; 
King, 2008). Recent CGH analysis detected 
aneuploidies in 14.3% of 77 pig embryos stud¬ 
ied (Hornak et a/., 2009). The chromosomes 
most frequently involved were SSC8, 11, 12, 
13, 17 and X, while aneuploidies of SSC 2, 9 
and 18 were rare. Interestingly, aneuploidy of 
SSC 10, which is frequently detected in oocytes 
(Lechniak et a/., 2007), was not found in 
embryos. The studies also suggest that there 
are specific stages of development that are 
more sensitive to chromosomal imbalance than 
others. For example, all abnormalities related 
to rcp(13;14) and X-autosome translocation 
are eliminated during the first 21 days of devel¬ 
opment (Neal et a/., 1998; King, 2008). 

Finally, porcine sperm, oocytes and zygotes 
are the only cells where numerical variation of 
the whole haploid complement has been found 
(Hancock, 1959; Thibault, 1959; Bomsel- 
Helmreich, 1961; McFeely, 1966; Hunter, 
1967; Moon et a/., 1975; Dolch and Chrisman, 
1981; McCauley et a/., 2003; Zudova et a/., 
2003). The described cases involve: (i) the pres¬ 
ence of more than two pronuclei in the ovum; 
(ii) the occurrence of heteroploid, triploid or 
mosaic embryos; (iii) the presence of XXXXY or 
XXXY sex chromosomes in tetraploid zygotes; 
and (iv) the detection of a few tetraploid, triploid 
or diploid/triploid 10-day-old blastocysts. 
Factors such as polyspermy, polygyny or sup¬ 
pression of the first cleavage division are among 
the reasons attributed to these abnormal chro¬ 
mosome numbers. 

Cytogenetics of cloned pigs 

Compared with the limited number of cytogen¬ 
etic studies of porcine oocytes and embryos, 
even less is known about the chromosomes 


of cloned pigs. The need for such analysis, how¬ 
ever, exists because chromosomal abnormalities 
have been found in 60-100% of the cells that 
were genetically modified to be used for somatic 
cell nuclear transfer (SCNT) (Mir et a/., 2003). 
Therefore, part of the inefficiency of nuclear 
cloning, in which only 10% of embryos survive 
to term (Heyman et a/., 2002; Campbell et a/., 
2007), can be attributed to chromosome aber¬ 
rations (King, 2008). Telomere shortening, as 
shown in cattle and sheep (Shiels and Jardine, 
2003; Alexander et a/., 2007), is another 
genetic consequence of cloning by SCNT. 
Interestingly, just the opposite tendency has 
been observed in pigs. The telomeres of cloned 
animals are either the same size or even longer 
than the telomeres in the donor cells or in age- 
matched controls (Jeon et ai, 2005; Kurome 
et ai, 2008; Tian et ai, 2009). It has been 

proposed that the restoration or enhancement 
of telomere length in cloned pigs is due to higher 
telomerase activity in porcine SCNT blastocysts 
than in nuclear donor cells and in vitro fertiliza¬ 
tion derived blastocysts (Jeon et ai, 2005). 

Comparative cytogenetics and pig 
karyotype evolution 

The advent of banding techniques during the early 
1970s facilitated chromosome identification 
but also provided the possibility of comparing 
chromosomes across closely and distantly 
related species. Comparison of the karyotypes 
of 15 living Suidae species (Table 7.4) supports 
the idea that karyotype evolution within this 
genus has been significantly influenced by cen¬ 
tric fusions (Gustavsson, 1990; Adega et ai, 
2006). Furthermore, centric fusion polymorph¬ 
isms in populations of wild/feral pigs have 
been observed in Asia, Europe (Tikhonov and 
Troshina, 1974, 1975) and North America 
(our unpublished data; Table 7.4). Interestingly, 
in different geographical areas, different types 
of polymorphisms exist. In Kyrgyzstani boars the 
centric fusion involves SSC 15 and 16, while in 
European (Tikhonov and Troshina, 1978; 
Troshina et ai, 1985) and North American 
animals (our unpublished data) it is predomi¬ 
nantly rob(15;17). The translocation reduces 
the chromosome number to 37 in heterozy¬ 
gotes (Fig. 7.3) and to 36 in homozygotes. 



Table 7.4. A summary of the current status of cytogenetics in the family Suidae. The evolutionary systematics of the family is derived from Lucchini et al. (2005), 
Robins et al. (2006) and Chen etal. (2007). 


Chromosome 


Genus 

Species 

Common name 

no. (2 n) 

Comments 

Reference(s) 

Babyrousa (babirusas) 

Babyrousa 

babyrussa 

Babirusa 

38 

11 autosomes and X identical to Sus 
scrota ; 5 autosomes with no direct 
equivalents; Y is aero with distinct p 
arm 

Bosma, 1980; Bosma and 

De Haan, 1981; Bosma 
etal., 1996; O’Brien, 2006 


Babyrousa 

babyrussa 

celebensis 

Sulawesi babirusa 

38 

Similar to babirusa 

O’Brien, 2006 

Phacochoerus (warthogs) 

Phacochoerus 

aethiopicus 

Desert warthog 

34 

14 autosomes and sex chromosomes 
similar to S. scrota ; Robertsonian 
translocation of chromosomes 13-16 
and 15-17 

Bosma, 1978; Melanderand 
Hansen-Melander, 1980 


Phacochoerus 

africanus 

sundervallii 

Southern warthog 

34 

Similar to desert warthog 

O’Brien, 2006 

Potamochoerus (African 
forest pigs) 

Potamochoerus 

porcus 

Red river hog 

34 

12 autosomes meta/submeta; 4 acros; X 
the largest and Y the smallest 
submeta 

Melander and Hansen- 
Melander, 1980; Bosma 
etal., 1991a; O’Brien, 2006 


Potamochoerus 

larvatus 

African bush pig 

34 

Similar to red river hog 

O’Brien, 2006 

Hylochoerus 

Hylochoerus 

meinertzhageni 

Giant forest hog 

32 

All autosomes and X are meta/submeta; 

Y is not analysed 

Melander and Hansen- 
Melander, 1980 

Sus (Eurasian swine) 

Sus scrota 

European wild pig 

36-38 

Rob(15/17) polymorphism 

McFee et al., 1966; Gropp 
etal., 1969; 

Rittmannsperger, 1971; 
Gustavsson etal., 1973; 
Tikhonov and Troshina, 
1975; Bosma, 1976; 
Popescu et al., 1980; 
O’Brien, 2006 
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S. scrota 

Asian wild pig 

36-38 

S. s. domestica 

Domestic pig 

38 

S. s. leucomystax 

Japanese 
wild pig 

38 

Sus porcula 
salvanius 

Pygmy hog 

38 

Sus verrucosus 

Javan warty pig 

38 

Sus barbatus 

Bearded pig 

38 

Sus celebensis 

Sus cebifrons 

Sulawesi warty 

pig 

Visayan warty pig 

38 

34 


Sus cebifrons 


34 


Rob(16/17) polymorphism 


2n = 38, the same in all breeds 
Karyotype identical to domestic pig 

Karyotype similar to S. scrota, centro- 
meric regions of acros have an extra 
band 

Karyotype similar to S. scrota except for 
chromosomes 10 and Y 
Karyotype similar to S. scrota 

Karyotype similar to S. scrota except Y 

14 meta/submeta, 2 acros 




Tikhonov and Troshina, 
1975;Troshina et al., 
1985; Liu etal., 2003 
Bosma et al., 1991a,b 
Muramoto etal., 1965; 
Okamoto et at, 1981; 
Bosma et at, 1991a,b 
Bosma et at, 1983 


Bosma et at, 1991a 

Bosma et at, 1991a; 

O’Brien, 2006 
Bosma et at, 1991a 

O’Brien, 2006 
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Fig. 7.3. Giemsa-stained karyotype (2 n = 37) of a female feral pig (from Virginia, USA) carrying a 
heterozygous translocation rob(15/17, arrow). Source: authors’ unpublished results. 


No information is as yet available for the 
chromosomes of two babirusa and three pig 
species, namely Bola Batu babirusa (. Babyrousa 
bolabatuensis), Malenge babirusa (Babyrousa 
togeanensis), Heude’s pig ( Sus bucculen- 
tus), Philippine warty pig ( Sus philippensis) 
and Oliver’s warty pig ( Sus philippensis 
oliveri). 

Finally, it is worth mentioning that com¬ 
pared with human, mouse and several other 
mammalian species, almost no G-band simi¬ 
larity has been detected between human and 
pig chromosomes (Grafodatskii and Biltueva, 
1987). The only region of putative homology 
was found between HSA2 and the distal 
part of SSClq. This, however, as shown later 
by Zoo-FISH (Fronicke et a/., 1996; Goureau 
et a/., 1996), turned out to be a false 
homology. 


Gene Mapping 

Gene mapping studies in pigs date back to 
almost five decades ago, when Andresen 
(Andresen, 1963) initiated the basic study of 
pig blood groups with the initial linkage maps. 
The formal beginning of physical gene map¬ 
ping in pigs falls into the early 1980s, and 
was inspired by the introduction of somatic 
cell hybrid technology (Goss and Harris, 
1975). The 1990s added radiation hybrid 
panels and ISH to the genome mapping ‘tool¬ 
kit’, which tremendously increased the quan¬ 
tity and quality of the maps generated during 
the last two decades. The development of 
resources such as cDNA (complementary 
DNA) and BAC genomic libraries (Rubes 
et a/., 2009) has essentially facilitated the 
development of markers for the construction 
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of cytogenetic, radiation hybrid (Faraut et a/., 
2009) and high-resolution BAC contig (con¬ 
tiguous) maps (Rogatcheva et a/., 2008). 
Today, over 10,000 loci have been mapped 
in the pig genome, using both linkage map¬ 
ping and different physical mapping 
approaches (Rogatcheva et a/., 2008; Faraut 
et a/., 2009). Of these, more than a half are 
aligned with the human genome sequence 
and establish a comparative framework 
between the two genomes. Most importantly, 
integrated physical, meiotic and comparative 
maps are critical for the assembly and 
annotation of the ultimate map of the por¬ 
cine genome - the whole genome sequence. 


Meiotic or genetic linkage maps 

Since the first linkage map, built by A. Sturtevant 
in T. Morgan’s laboratory nearly 100 years 
ago, the construction of meiotic or linkage 
maps has become an essential genetic proce¬ 
dure (Archibald and Flaley, 1998; Moran and 
James, 2005). Genetic linkage was initially 
revealed as a deviation from Mendel’s predic¬ 
tions on independent assortment. Genes that 
are located close to each other on the same 
chromosome do not assort independently in 
meiosis, which is explained by the linkage. The 
exchanges or crossovers between homologous 
chromosomes, which occur at meiosis during 
the formation of the gametes, break the link¬ 
age with a certain frequency. The proportion 
of recombinant genotypes is a measure of the 
crossing-over frequency. In general, the further 
apart two loci are on a chromosome the greater 
the chance that a crossover event will have 
taken place between them, and so the greater 
will be the proportion of recombinants. Hence, 
the recombination rate can be used for meas¬ 
uring the distance between two loci on a chro¬ 
mosome. There are two important requirements 
for the basic linkage mapping: large pedigrees, 
in which the relationships are known, and the 
availability of polymorphic genetic loci. Both 
these requirements were satisfied for the pig 
beginning in the early 1990s, and modern 
genomic tools now provide practically endless 
numbers of the polymorphic loci (Chapter 5). 
Physical distances between loci on DNA remain 


constant, and can be expressed in the number 
of nucleotides or other common metrics. In 
contrast, linkage between two genes or mark¬ 
ers always varies depending on the type of 
cross, genotype, region of a chromosome, sex 
and other factors. Despite this well known ‘vol¬ 
atility’ in measuring recombination distances 
between loci, linkage maps remain a unique 
tool in genetic research and selection even in 
the post-genomic era. While physical/genomic 
maps allow the highest possible accuracy 
(Chapter 8), linkage maps provide valuable 
functional information which can be used 
widely, as described later. 

During the 1980s and 1990s, significant 
efforts were made to develop linkage maps for 
humans and other species, including the pig. 
The difficulties in building a linkage map of the 
pig are usually compounded by a lack of know¬ 
ledge of the relative position of alleles on homol¬ 
ogous chromosomes, known as the phase. The 
major solution to this problem was calculating 
the likelihood ratio, which takes into account 
alternative phases. This procedure can be quite 
complex, particularly with large and compli¬ 
cated pedigrees. Fortunately several computer 
programs were developed, including linkage 
(Lathrop and Lalouel, 1988) and cri-map (Green 
et a/.-1990), which are capable of resolving 
these problems in most cases. The theoretical 
solution of these problems and the computer 
programs were major advances, which eventu¬ 
ally led to the construction of multi-locus linkage 
maps. For the pig, such linkage maps were built 
by the mid-1990s, and initially included nearly 
all the microsatellites available at that time (for 
details see Archibald and Haley, 1998). The 
latest porcine linkage map includes over 5000 
loci, hundreds of which are known genes (www. 
thearkdb.org). 

The measurement of the total length of 
sex-averaged linkage maps of all porcine auto- 
somes plus the X chromosome made by Rohrer 
et al. (1996) was 2286.3cM. The estimate of 
the complete genome length according to these 
authors is about 2470 cM, which is rather close 
to the observed values. Similar estimates were 
independently made by Marklund et al. (1996). 
Based on these data, Archibald and Haley 
(1998) assumed that the complete length of 
porcine linkage maps is around 2500cM. This 
is roughly equivalent to ~50 meiotic chiasmata 
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over the porcine genome, as one chiasma 
approximately corresponds to 50 cM. The latest 
available measurement of porcine linkage maps 
gives a sex-averaged estimate of 1711.8 cM 
(Vingborg et a/., 2009). However, the lengths 
of several linkage groups in this study were 
significantly lower than the minimal possible 
value of 50 cM. As in many other mammalian 
species, there is a significant difference in 
recombination rate between males and females. 
According to measurements by Marklund et al. 
(1996), the average ratio of female to male 
recombination was estimated at 1.4:1, but this 
parameter varied between chromosomes as well 
as between regions within chromosomes. A 
similar ratio of 1.6:1 was observed by Vingborg 
et al. (2009). The true causes of distinct recom¬ 
bination rates in females and males, as well as 
differences between chromosome regions, 
remain unclear (Moran and James, 2005). 

Linkage maps, besides their significant 
theoretical value in several fields of genetics, 
are essential for locating quantitative trait loci 
(QTLs), and can be used in marker-assisted 
selection (MAS, see Chapter 16). During the 
last 10-15 years, the underlying genetic archi¬ 
tecture of critically important porcine traits 
such as growth, litter size, disease resistance, 
meat quality and behavioural characteristics 
has become less hidden. Further study of QTLs 
and their interactions will continue to be of pri¬ 
mary interest. However, a link between pheno¬ 
type and genotype for quantitative traits is 
usually not very strong, as the genes involved 
in the development of these traits do not have 
large effects; there may also be a significant 
influence of environmental factors and devel¬ 
opmental randomness (Ruvinsky, 2009). 

Nevertheless, tracking the inheritance of 
markers in populations whose performance is 
recorded should allow some of the QTLs to be 
identified and the architecture of the genetic 
control of production traits to be at least par¬ 
tially determined. The general principle of this 
approach is simple; as soon as significant asso¬ 
ciations between the inheritance of a particular 
chromosomal region (as determined by marker 
inheritance) and trait variation are detected in a 
sufficiently large population, this suggests the exist¬ 
ence of a gene or genes affecting the traits in 
question. Efforts of numerous research groups, 
and particularly from Iowa State University, led 


to the creation of a QTL database for different 
agricultural animals, including the pig (http:// 
www. genome. iastate. edu/QTLdb/ notes. php). 
Once a QTL has been mapped to an interval 
between two arbitrary markers, there is a need 
to identify markers which are as close to that 
QTL as possible. These tightly linked markers 
are very rarely involved in meiotic recombina¬ 
tion and will continue to frame the QTL for a 
long time. As outlined in this and the following 
chapters, syntenic or linkage relationships over 
short distances (<3cM) are often conserved 
across species, and the pig has a typical mam¬ 
malian genome in this regard. Now that the 
sequence of the pig genome is known, the 
genes in the QTL region can be examined for 
causative mutations. 


Somatic cell hybrid (SCH) panels 
and synteny mapping 

Somatic cell genetics (Kucherlapati and Ruddle, 
1975; Ruddle, 1981; Kao, 1983; Faraut et al., 
2009) has been the key to the development of 
the hybrid cell panels commonly used for gen¬ 
erating synteny maps. Over the years, very 
little has changed in terms of the approaches 
used to develop these panels in different 
species. 

As in most mammalian species, SCH 
genetics in pigs began with the mapping of 
enzyme genes, first to the X chromosome 
(Forster, 1980; Forster et al., 1980; Leong 
et al., 1983) and then to the autosomes 
(Ryttman et al., 1986, 1988). The introduc¬ 
tion of PCR revolutionized porcine SCH analy¬ 
sis, and, in 1998, synteny mapping of more 
than 228 loci was reported (Wintero et al., 
1996; Yerle et al., 1996; Fridolfsson et al., 
1997). In the following few years, this figure 
increased more than threefold (Lahbib-Mansais 
et al., 1999, 2000; Davoli et al., 2000). The 
development of expressed sequence tags (ESTs) 
from porcine cDNA libraries and of sequence 
tagged sites (STSs) from BAC end sequences 
provided hundreds of new markers for synteny 
mapping in the early 2000s (Maak et al., 
2001; Tosser-Klopp et al., 2001; Cirera 
et al., 2003; Robic et al., 2003; Demeure 
et al., 2005). The ‘popularity’ of porcine 
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synteny mapping in the era when much more 
efficient and precise physical mapping 
approaches are available is probably because 
two porcine SCH panels (Rettenberger et al., 
1994a,b; Robic et al., 1996; Yerle et al., 1996) 
are so well characterized that they enable the 
accurate assignment of markers either to a 
chromosomal arm or even to specific bands. 
The third panel (Zijlstra et al., 1996) is also 
fairly useful, although it is not completely 
informative for some chromosome pairs. 
Overall, while SCH mapping is gradually com¬ 
ing to an end in most domestic species, the 
method is still occasionally used in pigs. For 
example, the genome-wide distribution of por¬ 
cine endogenous retroviruses was recently 
determined using several mapping approaches, 
including SCH analysis (Jung et al., 2010). 


Radiation hybrid (RH) panels 
and RH mapping 

Radiation hybrid (RH) mapping (see Chowdhary 
and Raudsepp, 2005; Faraut et al., 2009) is 
essentially an SCH technique with the differ¬ 
ence that, before the fusion of cell lines, the cells 
of the species of interest (donor) are exposed to 
high (lethal) doses of X-ray irradiation that cause 
the fragmentation of chromosomes. The dos¬ 
age may range from as low as 3000 rad (Gyapay 
et al., 1996; McCarthy et al., 1997) to as high 
as 50,000 rad (Lunetta et al., 1996). The 
higher the radiation dose, the higher is the reso¬ 
lution and mapping power of the RH panel, 
because more breaks and smaller chromosomal 
fragments are produced. Higher radiation doses 
also mean that more markers need to be geno- 
typed in order to construct RH maps. It has 
been calculated that, in a typical mammalian 
genome, an irradiation dose of 3000 rad should 
allow the mapping of about 3000 markers, 
while about 12,000 markers can be mapped to 
unique positions on a 12,000-rad RH panel 
(Chowdhary and Raudsepp, 2005; Faraut et al., 
2009). Traditionally, RH genotyping is carried 
out by PCR, and any type of markers, regard¬ 
less of their polymorphism status, can be 
assigned to RH maps. The data are analysed 
with dedicated software programs, of which 
Concorde (Agarwala et al., 2000) and 


carthagene (de Givry et al., 2005) have been 
recently used most. 

To date, four different whole genome 
(WG) RH panels have been constructed in the 
pig (Faraut et al., 2009): a commercial 3000- 
rad panel, T43RH (produced in Peter 
Goodfellow’s laboratory, commercially availa¬ 
ble through Research Genetics); a 5000-rad 
pig-mouse panel, SSRH (Hamasima et al., 
2003); a 7000-rad pig-hamster panel ImpRH 
(Yerle et al., 1998); and a 12,000-rad pig- 
hamster panel IMNpRH2 (Yerle et al., 2002). 
During the past decade, these panels have 
been extensively used for mapping individual 
chromosomes and chromosomal regions, and 
for developing WG RH maps. The T43RH 
panel has led to the development of RH and 
comparative maps for SSC2 (Rattink et al., 
2001), SSC6 (Cao et al., 2004) and porcine 
sex chromosomes (McCoard et al., 2002; 
Quilter et al., 2002). The panel was recently 
used to map six Toll-like receptor-related genes 
and generate RH and comparative maps for 
SSC8 and 13 (Jann et al., 2009). The 5000- 
rad SSRH panel allowed the construction of 
comparative maps for SSC8 and HSA4 (Jiang 
et al., 2002) and two WG RH maps. The first 
comprised 298 ESTs from porcine backfat 
tissue (Mikawa et al., 2004), and the other 
contained 4016 ESTs and microsatellite mark¬ 
ers (Hamasima et al., 2008). 

The most intensely exploited RH panel, 
however, has been the 7000-rad IMpRH. 
During the golden years of RH mapping at the 
end of the 1990s and in the early 2000s (Faraut 
et al., 2009), this panel allowed the mapping 
of large numbers of ESTs (Lahbib-Mansais et al., 
1999; Karnuah et al., 2001; Wang et al., 2001; 
Rink et al., 2002, 2006; Cirera et al., 2003) 
and orthologues of human genes (Lahbib- 
Mansais et al., 2000, 2003), resulting in high- 
resolution WG comparative maps (Rink et al., 
2002, 2006). The comparative maps revealed 
40 major breaks in synteny with the human 
genome and identified SSC2, 5, 6, 7, 12 and 
14 as ‘gene rich’ and SSC11 and X as ‘gene 
deserts’ (Rink et al., 2006). The IMpRH panel 
has contributed to mapping regions containing 
QTLs and disease genes. For example, detailed 
RH maps were developed over the RN locus 
on SSC15q25 (Robic et al., 1999) and the 
ETEC F4ac receptor locus on SSC13q41 (Ren 
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et a/., 2009). High-resolution RH and com¬ 
parative maps were also constructed for indi¬ 
vidual chromosomes, namely SSC2q and SSC16 
(Shimogiri et a/., 2006), SSC1 and SSC 7 
(Demeure et a/., 2005), SSC 17 (Lahbib- 
Mansais etai, 2005), or chromosomal regions, 
namely SSClq (Yasue et a/., 2008). The panel 
has been instrumental in integrating the link¬ 
age and physical maps of individual porcine 
chromosomes, namely SSC 13 (Van Poucke 
et a/., 2001), SSC4q (Stratil et a/., 2001), 
SSC15q21 (Robic et a/., 2001) and SSClq 
(Sarker et a/., 2001), or the whole genome 
(Hawken et a/., 1999). 

The highest resolution porcine RH panel 
is the 12,000-rad IMNpRH2 (Yerle et a/., 
2002), which was generated to complement 
the 7000-rad IMpRH (Yerle et ai, 1998) for 
fine mapping QTLs and major genes, and to 
support the WG BAC fingerprint map and the 
WG sequence assembly. Comparison of the 
IMpRH and IMNpRH2 framework maps for 
the RN region on SSC 15 showed that the 
resolution of IMNpRH2 exceeds that of the 
IMpRH almost three times (Yerle etai, 2002). 
Since then, the IMNpRH2 has been used 
alone or in combination with the IMpRH to 
generate high-resolution maps for the SSC7 
QTL region (Demars et a/., 2006), refine 
conserved synteny between SSC 12 and 
HSA17 (Liu et a/., 2005) and orient BAC/ 
PAC contigs over SSC6ql.2 (Martins-Wess 
etal., 2003a,b). Development of STS markers 
from BAC end sequences (BES) and their 
genotyping on IMpRH and/or IMNpRH2 
have essentially increased the number of 
markers and the resolution of maps (Kiuchi et 
a/., 2002; Meyers et a/., 2005). This has led 
to the fine integration of RH maps with mei- 
otic, BAC fingerprint and comparative maps 
for SSC2p and SSC9p (Liu et a/., 2008), 
SSC10 (Ma et a/., 2009) and SSC 13 
(Rogatcheva et a/., 2008). Notably, the RH 
map of SSC 13 is solely based on BES mark¬ 
ers and has a comparative anchor at every 
1.22 Mb relative to the human genome 
sequence. Most importantly, RH mapping of 
over 2000 BES markers (Meyers et a/., 2005) 
has assisted the assembly of the WG BAC fin¬ 
gerprint map (Humphray et a/., 2007) and 
the selection of a BAC minimal tiling path for 
the sequencing of the pig genome. 


Public release of the first draft of the swine 
genome sequence (University of Illinois at 
Urbana-Champaign, 2009) will not reduce the 
need for accurate and high-resolution RH and 
integrated maps because such maps are indis¬ 
pensable for sequence assembly precision (Lewin 
et a/., 2009). However, increasing amounts of 
sequence data and the construction of new 
genome analysis tools, such as SNP arrays, may 
introduce dramatic changes in mapping strate¬ 
gies (Faraut et a I., 2009). The availability of 
sequences simplifies and expedites production of 
mapping markers. This creates a need for higher 
resolution RH panels, with an increased dose of 
irradiation and more clones. Alternatively, the 
RH panels can be genotyped on the pig 6OK 
SNP beadchip (Ramos et a!., 2009) or the 
Illumina 50K SNP chip (Porcine SNP Chip 
Consortium), which allows simultaneous geno¬ 
typing of 50,000-60,000 markers on each RH 
clone. As a consequence, very large numbers of 
markers can be genotyped on an RH panel in a 
short time, and the construction of maps will 
need improved statistical analysis methods. 
Overall, despite the rapid development of tech¬ 
nology and changes in the mapping paradigm, it 
is not likely that the need for fine and accurate RH 
maps in swine will disappear too soon. 


Large-insert clone libraries 
and clone-based mapping 

An essential resource for the construction of 
fine-scale physical genome maps is large- 
fragment genomic libraries cloned into yeast 
artificial chromosome (YAC), PI bacterio¬ 
phage, PI-derived artificial chromosome 
(PAC) or bacterial artificial chromosome (BAC) 
vectors. The first such libraries in pigs were 
constructed using YAC vectors (Leeb et a/., 
1995; Alexander et ai, 1997; Rogel-Gaillard 
et ai, 1997a), followed by a PAC library 
(Al-Bayati et ai, 1999). It was soon realized 
that, although YACs can carry inserts larger 
than 1000 kb, they also show a high degree of 
chimerism and are thus not the best for map¬ 
ping or sequencing. PACs, in contrast, can 
take in about 100-300kb inserts and are stable, 
but are more difficult to handle than BACs. 
Thereafter, BACs have been the vectors of 
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choice to construct genomic libraries for the 
pig. Currently, there are seven WG BAC librar¬ 
ies (Table 7.5), most of which have been exten¬ 
sively used for positional cloning, QTL-, RH-, 
FISH- and clone-based mapping. Four librar¬ 
ies, namely CHORI-242, RPCI-44, INRA and 
PigEBAC, are incorporated into the integrated 
physical map of the pig genome (Humphray et 
a/., 2007), which has been the backbone for 
the pig genome sequencing project (Schook et 
a/., 2005). The most recent library, CHORI- 
247, has the highest genome coverage (15X) 
and was constructed to get a good representa¬ 
tion of the pig Y chromosome sequences (B.P. 
Chowdhary, unpublished data). 

BAC-based chromosome and regional maps 

Since the first porcine BAC libraries became 
available in 1999-2000, BAC clones have 
found intensive use in the development of con- 
tig maps over shorter regions in the pig genome, 
of which SSC6, 7, 13, and 15 have been of 
particular interest. The first BAC contigs were 
constructed over the ~2.5Mb region around 
the ‘acid meat’ RN locus on SSC15q25 (Jeon 
et a/., 2001; Robic et a/., 2001). The contig 
allowed the identification and fine mapping of 
tens of genes in the region, and showed that 
the homologous segment on HSA2q35 shares 
a high degree of linkage conservation with the 
pig. Several BAC maps have been constructed 
in SSC7 for the porcine MHC (major histocom¬ 
patibility complex) (Barbosa et a/., 2004; Ando 
et a/., 2005; Demars et a/., 2006) and for 
regions containing fat-related QTLs (Sato et a/., 
2006; Tanaka et a/., 2006). Another chromo¬ 
some of interest is SSC6, because it harbours 
the RYR1 locus for stress susceptibility and a 
putative QTL for muscle growth (Martins-Wess 
et a/., 2002). Two BAC/PAC contigs, one 
1.2Mb (Martins-Wess et a/., 2002) and another 
5.5Mb (Martins-Wess et a/., 2003a,b), were 
developed to span the RYR1 locus at 
SSC6q21. The two contigs have been very use¬ 
ful in characterizing this gene-rich region and 
comparing it with HSA19ql3.1 (Martins-Wess 
et a/., 2003b). Sequencing BACs from a contig 
on SSC6q28-q31 that spans several fat-related 
QTLs provided important information for 
comparative mapping and positional candidate 
genes (Lee et a/., 2006). Similarly, BAC contigs 


have helped positional candidate gene cloning 
and refined the map in the region associated 
with neonatal diarrhoea on SSC13q31-q32 
(Van Poucke et a/., 2005; Joller et a/., 2009). 
Besides these chromosomes, a contig compris¬ 
ing 51 BAC clones was developed over SSC2p- 
ql3, a region of maternally imprinted QTL for 
backfat thickness (Rattink et a/., 2001). Fine 
regional BAC maps have also been instrumen¬ 
tal in the discovery of a large duplication in the 
KIT gene associated with dominant white coat 
colour in pigs (Giuffra et a/., 2002), and for the 
discovery of a mutation in ASIP locus responsi¬ 
ble for the black-and-tan pigmentation in 
Mangalitsa pigs (Drogemuller et a/., 2006). 

Whole genome BAC fingerprinted contig 
(FPC) and comparative maps 

The construction of porcine BAC libraries and 
clone-based mapping have played a vital role in 
the international Porcine Genome Sequencing 
Project (http://www. sanger. ac. uk/Projects/S_ 
scrofa/). Recent sequencing projects for dogs 
(Lindblad-Toh et a/., 2005), cattle (Elsik et a/., 
2009) and horses (Wade et a/., 2009) mainly 
used the WG shotgun method, in which BACs, 
BAC contig maps and BES helped only to vali¬ 
date sequence assembly and to anchor sequence 
scaffolds to the chromosomes. In contrast, pig 
genome sequencing is exclusively clone-based 
and relies on the BAC fingerprinted contig 
(FPC) map (Humphray et a/., 2007), the BES- 
based WG RH map (Meyers et a/., 2005) and 
the comparative maps produced by aligning pig 
BES with the sequence map of the human 
genome (Meyers et a/., 2005; Humphray 
et a/., 2007; Rogatcheva et a/., 2008). 

The BAC FPC map was constructed by 
digesting BAC DNA with Hi n dill followed by 
high-throughput fingerprinting of over 
260,000 BAC clones from four different BAC 
libraries (Table 7.5). Altogether, 267,884 fin¬ 
gerprints were assembled into 524 contigs, 
and a highly continuous BAC contig map with 
15.3x coverage across the 2.7 Gb of porcine 
genome was constructed. The map spans 98% 
of the euchromatin of the 18 pig autosomes 
and the X and the Y chromosomes (Humphray 
et a/., 2007). Incorporation of the Y chromo¬ 
some is an outstanding difference from species 
such as cattle (Elsik et a/., 2009) and horses 
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Table 7.5. List of whole genome BAC libraries constructed for the pig. 


Library 

Genome 

coverage 

Vector 

Average 
insert 
size (kb) 

Breed 

Sex 

Reference(s) 

INRA, Pigl a 

5x 

pBeloBAC11 

135 

Large White (LW) 

Male 

Rogel-Gaillard 
etal., 1999 

Japan 

4.4x 

pBAC-lac 

133 

LW x Land race x 

Du roc 

Male 

Suzuki etal., 

2000 

pEBAC 3 

5x 

pBeloBAC11 

150 

FI LW x Meishan 

Male 

Anderson 
etal., 2000 

RPCI-44 a 

11.3x 

pTARBAC2 

165 

Yorkshire x Landrace 
x Meishan 

Males (4) 

Fahrenkrug 
etal., 2001 

Korea 

7x 

pBACe3.6 

125 

Korean native 

Male 

Jeon etal., 2003 

CHORI-242 3 

11.4x 

pTARBACI .3 

173 

Du roc 

Female 

http://bacpac. 

chori.org/ 

home.htm 

CHORI-247 

15x 

pTARBAC2.1 

ni 

Duroc ID No. 243-2 

Male 

http://bacpac. 

chori.org/ 

home.htm; 

B.P Chowdhary, 

unpublished 

data 


a Used for the construction of BAC (bacterial artificial chromosome) fingerprinted contig (contiguous) (FPC) map and for 
pig genome sequencing, 
ni, not indicated. 


(Wade et a/., 2009), where female individuals 
were used for the sequencing, thus excluding 
the Y chromosome. The final map integrates 
the BAC FPCs with the 2068 BES-marker 
whole-genome RHs (Meyers et a/., 2005) and 
the BES-based pig-human comparative sequence 
maps (Humphray et a/., 2007). This high-utility 
map provided a template for clone tile path 
selection for pig genome sequencing, and can be 
used for ‘electronic cloning’ of the regions of 
importance in the porcine genome. Furthermore, 
virtual integration of all available map informa¬ 
tion has resulted in the construction of Virtual 
Comparative (VC) Maps (Bio::Neos, 2010) for 
SSC1, 4, 5, 7, 11, 13 and 14, which give the 
most detailed physical map organization availa¬ 
ble so far and integrate sequence data with the 
data of all other physical maps. 

Cytogenetic mapping using FISH 

Cytogenetic maps of the pig genome 

The pig was the first farm animal to which the 
ISH technique was applied for gene mapping. 
Twenty-six years ago, the porcine MHC locus 


was assigned to the pericentromeric region of 
SSC7 (Geffrotin et a/., 1984; Rabin et a/., 
1985; Echard et a/., 1986). Thereafter, a few 
groups initiated organized ISH experiments to 
expand the porcine physical gene map. By 
1993, 68 genes had already been localized using 
the radioactive method (Chowdhary, 1998). 
One of the major advantages of radioactive ISH 
was the ability to use small heterologous (mainly 
human) probes because, at the onset of organ¬ 
ized genome analysis in pigs, very few cloned 
porcine sequences were available. 

The early 1990s witnessed the introduc¬ 
tion of non-isotopic techniques and, since then, 
cytogenetic mapping in pigs has been con¬ 
ducted using FISH. One of the first principal 
uses of FISH in domestic animals was to align 
all individual syntenic and linkage groups to 
specific chromosomes. In pigs, this ‘mission’ 
was accomplished in about 5 years (Ellegren et 
a/., 1993, 1994; Robic et a/., 1996, 1997; 
Rohrer et a/., 1996). An important ‘booster’ 
for FISH mapping was that finally there were 
available porcine specific large-insert clones - 
first cosmids (Ellegren et a/., 1993; Alexander 
et a/., 1996) and, later, YACs and BACs (see 
previous section). In a few studies, with the aim 
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of constructing comparative maps, attempts 
were made to FISH map small cDNA sequences 
(Chaudhary et a/., 1997; Thomsen et a/., 
1998). Owing to the relatively weak signals 
produced, this approach did not find much 
use later. 

The most intensive FISH mapping period 
in porcine genomics was at the end of the 
1990s when, in just a few years, the number 
of cytogenetically assigned loci tripled - from 
154 in 1995 (Yerle et a/., 1995) to 436 in 
1997 (Yerle et a/., 1997). Among the 436 
loci, 160 represented specific genes and the 
remaining 276 were Type II (polymorphic) 
loci. Since then, the use of FISH for porcine 
gene mapping has gradually decreased. In the 
past 10 years, only about 500 new loci have 
been added to the cytogenetic map. Among 
these, the majority are single or a few loci 
assignments to map disease or QTL-related 
genes, for example, mapping candidate genes 
for human obesity (Nowacka-Woszuk et a/., 
2008) or porcine pregnancy-associated genes 
(Majewska et a/., 2010). Only a few studies 
have carried out GW mapping of larger num¬ 
bers (over 50) of loci (Lopez-Corrales et a/., 
1999; Finton, P. et a/., 2000; Anistoroaei 
et a/., 2004). Several FISH projects have 
focused on specific chromosomes, namely 
SSC13 (Sun, H.F. et a/., 1999; Van Poucke 
et ai, 1999, 2003; Stratil et a/., 2001), SSC11 
(Sun, H.S. et a/., 1999) or chromosomal 
regions, such as the RN region on SSC15 
(Tornsten et ai, 1998). The FISH mapping 
of 19 genes to the pig X and Y chromosomes 
(Quilter et a/., 2002) has been, so far, the only 
detailed study of the comparative organiza¬ 
tion of porcine sex chromosomes. Compared 
with the horse and dog, in which cytogenetic 
anchors have been an instrumental part in 
the construction of WG RH and integrated 
maps (Breen et a/., 2004; Raudsepp et a/., 
2008), the approach has found much less use 
in pigs (Cirera et a/., 2003; Lahbib-Mansais 
et a/., 2003). This is probably because RH 
mapping of thousands of BES markers 
(Meyers et a/., 2005) and their alignment 
with BAC FPCs (Humphray et a/., 2007) 
and human genome sequences (Rogatcheva 
et a/., 2008) have already provided sufficient 
numbers of anchors for most of the porcine 
chromosomes. 


High-resolution cytogenetic maps 

fibre-fish. Fibre-FISH is an essential improve¬ 
ment in the resolution of chromosome mapping 
in which FISH is conducted on preparations of 
extended chromatin fibres. DNA probes are 
typically mapped to metaphase chromosomes at 
a resolution of about 5 Mb (Raudsepp and 
Chowdhary, 2008). In contrast, fibre-FISH 
enables the distinction of probes separated by 
1-2 kb, and can be used for positional cloning, 
determining the transcriptional orientation of 
clones, or the detection of minor chromosomal 
rearrangements (Laan et ai, 1996). In domestic 
animals, the technique was first used in pigs to 
study the organization of the MHC (Sjoberg 
et a I. , 1997) and to order subclones from the 
porcine erythropoietin gene (Liu et a\. , 1998). 
The studies delineated the order of individual 
clones, and also estimated the physical distances 
(kb) between them. 

dna-combing. DNA-combing is an approach 
that technically resembles fibre-FISH (Conti 
et a/., 2001; Labit et ai , 2008), but differs in 
the source and preparation of target DNA. 
The target may include any cloned DNA that is 
applied in solution, on a silanated glass slide. 
Gradual evaporation of the solution under a 
glass coverslip stretches, straightens and fixes 
the DNA molecules, thus making them available 
as targets for hybridization. The technique is 
best suited to constructing a contig map of a 
set of subclones originating from a BAC or 
YAC clone. The only study hitherto carried out 
in pigs, or in any other farm animal, relates to 
ordering a set of plasmid subclones derived 
from a lambda clone containing the porcine 
EPO gene (Liu et al. , 1998). Distinct 
hybridization signals on combed DNA fibres 
enabled the ordering of the three subclones, 
and the results were in complete agreement 
with those obtained by fibre-FISH in the same 
study. 

Physical mapping using flow-sorted and 
microdissected chromosomes 

Relatively few physical maps have been con¬ 
structed using porcine flow-sorted and microdis¬ 
sected chromosomes. In one study, chromosomal 
segments for SSClp, Iq26-q2.13, 2qll-ql4, 
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4ql2-q25, 13ql2-q31, 13q32-q43 and 

16q21-q23 and for whole chromosomes 
SSC13 and SSC15 were microdissected 
(Chaudhary et a/., 1998a). After verifying DNA 
origin by FISH, the microdissected material was 
used to construct regional libraries and to isolate 
microsatellite markers. Microdissected or flow- 
sorted chromosome-specific libraries have also 
been constructed for SSClq (Sarker et a/., 
2001), SSC6 (Ambady et a/., 1997; Grimm 
et a/., 1997; Zhao et a/., 1999), SSC8 (Wang 
et a/., 2000a,b), SSC11 (Riquet et a/., 1995), 
SSC13 (Davies et a/., 1994; Wang et a/., 2001) 
and SSC18 (Ellegren and Basu, 1995). The 
approach has produced microsatellite markers 
to integrate linkage or RH maps with the pre¬ 
cise cytogenetic location. 


Comparative chromosome 
maps - Zoo-FISH 

An important breakthrough in the construction 
of physical chromosome maps was the inven¬ 
tion of cross-species FISH, also referred to as 
Zoo-FISH (Scherthan et a/., 1994). The proce¬ 
dure is basically the same as for regular FISH 
except for some modifications, of which the 
most critical are the use of heterologous probes 
with over two times higher DNA concentra¬ 
tion and prolonging the hybridization time 
(Chowdhary and Raudsepp, 2001). In a few 


studies, the probes originated from large insert 
libraries. For example, human FAC clones 
containing human LCAT gene cluster were 
used to determine the homologous region on 
SSC6pl3 (Frengen et a/., 1997). The majority 
of the Zoo-FISH studies, however, have used 
composite chromosome-specific painting 
probes to determine chromosomal homologies 
between species. The method is, therefore, 
also called comparative chromosome painting 
(Scherthan et a/., 1994). 

The primary goal of Zoo-FISH was, and 
still is, to transfer gene map information from 
‘map rich’ species to ‘map poor’ species, and to 
study karyotype and chromosome evolution 
(Chowdhary et ai, 1998; Chowdhary and 
Raudsepp, 2001; Ferguson-Smith and Trifonov, 
2007). With respect to the porcine genome, 
Zoo-FISH experiments have been conducted 
between closely related species, namely the pig 
and other suids/Suiformes or cetartiodactyls, 
and between distantly related species, namely 
the pig and primates or carnivores (Table 7.6). 

Human-pig Zoo-FISH 

The pig genome was the first among farm 
animals that was examined with the whole set 
of human chromosome paints (Rettenberger 
et ai, 1995; Fronicke et ai, 1996). Both stud¬ 
ies essentially arrived at the same conclusion 
and proposed that there are 47 segments of 
conserved synteny between the human and the 


Table 7.6. Zoo-FISH (fluorescence in situ hybridization) between pig and other mammalian species. 
Unidirectional painting is indicated by and bidirectional by 

Species Reference(s) 


Between suids 
Pig babirusa 
Between Suiformes 

Pig -> collared and white-lipped peccary 
Pig Arizona collared peccary 
Between cetartiodactyls 
Pig cattle 
Sheep -» pig 
Dromedary camel pig 
Between distantly related mammals 
Human pig 

Human pig 
Dog -> pig 


Bosma etal., 1996 

Bosma et at., 2004 
Adega et at., 2006 

Schmitz etal., 1996 
Fronicke and Wienberg, 2001 
Balmus etal., 2007 

Rettenberger etal., 1995; Fronicke etal., 1996; 

Chaudhary etal., 1998b 
Goureau etal., 1996 
Biltueva et a!., 2004 
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pig (Plate 2). These initial results were further 
refined by human-pig whole genome recipro¬ 
cal chromosome painting (Goureau et a/., 
1996; Milan et a/., 1996), and by Zoo-FISH 
using arm-specific paints for HSA2, 5, 6 and 
16 (Chaudhary et a/., 1998b). The main 
significance of these early studies was their 
contribution to the development of the porcine 
WG gene map. 

Later on, FISH mapping of single genes 
(Pinton, P. et a/., 2000) and the construction of 
high-resolution comparative RH and BAC con- 
tig maps (Rink et a/., 2002; Meyers et a/., 
2005; Humphray et a/., 2007; Rogatcheva 
et a/., 2008) essentially confirmed the findings 
of Zoo-FISH, but also refined and improved it. 
Rink and colleagues (Rink et a/., 2002) were 
able to add three new conserved synteny 
groups to the human-pig comparative map, 
and provisionally assigned a few others. They 
also identified at least 60 break-points and 90 
micro-rearrangements between the human and 
the porcine genomes. Today, 51 conserved 
synteny groups and 173 conserved segments 
are known between the two genomes (Meyers 
et a/., 2005), although the most comprehen¬ 
sive comparison will soon be available at the 
genome sequence level. 

Zoo-FISH between pig and other 
domestic species 

Zoo-FISH has been used to delineate segmen¬ 
tal homologies and integrate gene maps 
between pigs and cattle (Schmitz et a/., 1996), 
pigs, sheep and cattle (Fronicke and Wienberg, 
2001), pigs and dogs (Biltueva et a/., 2004) 
and, most recently, between pigs, humans, 
cattle and camels (Balmus et a/., 2007) 
(Table 7.6). Notably, a high number of evolu¬ 
tionary rearrangements, involving fissions, 
fusions and inversions, have been found 
between the karyotypes of pig and species of 
Bovidae (Fronicke and Wienberg, 2001). For 
example, pigs and sheep share 62 conserved 
synteny groups, which is more than the number 
between pigs and humans (Meyers et a/., 2005), 
and unusually high for a species belonging to 
the same mammalian order. Differences 
between ruminant and suid genomes have been 
attributed to extensive karyotype rearrange¬ 
ments in the Suidae family, characterized by 


multiple inversions, fissions and fusions 
(Fronicke and Wienberg, 2001). In contrast, 
only 53 autosomal conserved segments were 
found between camels and pigs (Balmus et a/., 
2007). It is, however, of no surprise that almost 
100 homology blocks were found between the 
pig and dog karyotypes (Biltueva et a/., 2004). 
The two species are evolutionarily distant and 
both have undergone relatively rapid karyotype 
evolution. 

Zoo-FISH has also helped in the under¬ 
standing of chromosome evolution within suid 
and Suiformes species. Fig chromosome paint¬ 
ing probes have helped to refine chromosomal 
homologies with babirusas (Bosma et a/., 
1996) and two different peccary species 
(Bosma et a/., 2004; Adega et a/., 2006). 


Concluding Remarks 

During the ‘golden’ years of pig genome map¬ 
ping in the 1990s, it seemed that the primary 
role of porcine cytogenetics is to be a launch 
pad for physical mapping, while the need for 
classical chromosome analysis would gradually 
disappear. Therefore, it is pleasant to see that 
pig cytogenetics has successfully survived the 
‘mapping era’ and is entering the ‘sequencing 
era’, armed with molecular tools borrowed 
from physical gene mapping. Furthermore, 
there are clear signs that porcine cytogenetics 
is ready to take the next step - combining the 
genome sequence data with array technology 
and bringing chromosome analysis to sub- 
microscopic levels. Thus, application of novel 
molecular approaches, together with pressing 
commercial needs, is expected to continue 
porcine chromosome analysis as an important 
part of agricultural programmes in many 
countries. 

Gene mapping, in contrast, is in a transi¬ 
tional phase. With the availability of the WG 
BAC FPC map (Humphray et a/., 2007), VC 
maps (Bio::Neos, 2010) and the draft genome 
sequence, the need for other maps will reduce. 
However, as experienced in humans and 
other sequenced species, the need for high- 
resolution meiotic and physical maps will not 
disappear completely. This idea was elegantly 
formulated in the phrase ‘every genome 
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sequence needs a good map’ (Lewin et a/., 
2009), meaning that high-resolution gene and 
comparative maps remain indispensable to 
validate sequence assembly. This need is even 
more pronounced for the assembly of millions 
of short reads produced by next and next-next 
generation sequencing technologies. Overall, 
integrated meiotic and physical maps con¬ 
tinue to offer a well-tested resource for the 
analysis of genome architecture, function and 


evolutionary history in many species, includ¬ 
ing the pig. 
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Introduction 

The systematic characterization and mapping 
of the porcine genome started in the late 
1980s and early 1990s following the emer¬ 
gence of genome research stimulated by the 
Human Genome Organisation (HUGO) with 
its goal to map and sequence the complete 
human genome. The EU-funded Pig Gene 
Mapping Project (PiGMaP) constituted the first 
internationally coordinated effort to map the 
porcine genome, and, with this, the pig was 
the first livestock species whose scientific com¬ 
munity organized to completely map the pig 
genome (Haley et a/., 1990). The initial efforts 
of the PiGMaP project focused on the develop¬ 
ment of genetic markers in the pig (Davies 
eta/., 1994; Coppieters et a/., 1995; Groenen 
et a/., 1995), and the establishment of a 
genetic linkage map (Archibald et a/., 1995) 
and a cytogenetic map (Echard et a I., 1992; 
Yerle et a/., 1995). In September 2003, the 
sequencing of the complete genome of the pig 

©CAB International 2011. The Genetics of the Pig, 
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formally started with the establishment of the 
Swine Genome Sequencing Consortium 
(SGSC) by representatives from academia, 
government and industry (Schook et a/., 
2005). The necessary foundation towards this 
goal had been set up in the previous decade 
through the development of detailed linkage 
maps (Ellegren et a/., 1994; Rohrer et a/., 
1994, 1996; Archibald et a/., 1995) and phys¬ 
ical maps based on a variety of methods such 
as somatic cell hybrids (Rettenberger et a/., 
1994, 1996; Yerle et a/., 1996), in situ 
hybridization (Fronicke et a/., 1996; Goureau 
et a/., 1996; Chowdhary et a/., 1998) and 
whole-genome radiation hybrid (RH) mapping 
(Yerle et a/., 1998, 2002; Hawken et a/., 
1999). In this chapter, the focus is on com¬ 
parative genome maps, genomic resources, 
genome variation and sequencing of the por¬ 
cine genome. Other maps will be briefly dis¬ 
cussed when relevant to these subjects. Because 
systematic genome mapping in the pig started 
with the generation of comprehensive linkage 
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maps, and because the bacterial artificial chro¬ 
mosome (BAC) map of the porcine genome has 
provided the template for the generation and 
assembly of the high-quality anchored sequence 
of the porcine genome (Schook et al, 2005; 
Humphray et al., 2007), we will also discuss 
these maps in more detail in this chapter. 

Porcine Expressed Sequence Tag 
(EST) Sequencing and Clustering 

Partial sequencing of expressed sequences is an 
efficient and economical method to rapidly 
acquire information about the gene content of 
an organism. Pioneered in the early 1990s by 
Craig Venter and co-workers (Adams et al., 
1991), it has become an important genome 
resource in functional genomics (expression 
studies), as well as an invaluable tool for the 
annotation of the genome sequence and the 
construction of gene models. The generation of 
pig ESTs was initiated by several groups (Tuggle 
and Schmitz, 1994; Wintero et al., 1996; 
Tosser-Klopp et al., 1997); in particular, the 
Sino-Danish Pig Genome Project has boosted 
the number of porcine ESTs by sequencing 
over one million porcine ESTs derived from 97 
different cDNA (complementary DNA) libraries 
(Gorodkin et al., 2007). The current (27 August 
2010) number of pig ESTs in the Expressed 
Sequence Tags database (dbEST) at NCBI (the 
US National Center for Biotechnology 
Information) is 1,621,000 (http://www.ncbi. 
nlm.nih.gov/dbEST/index.html) with the most 
recent UniGene Build (No. 39, 7 August 2010; 
http: //www. ncbi. nlm. nih. gov/unigene/estpro 
fileviewer) comprising 53,600 different Unigene 
clusters. Similarly, within the Gene Index 
Project (Lee et al., 2005), the Computational 
Biology and Functional Genomics Laboratory 
at the Dana Farber Cancer Institute has used 
the porcine EST and mRNA sequences to pro¬ 
duce a gene index of transcripts found in the 
pig. Within this project, more stringent cluster¬ 
ing parameters were used, resulting in higher 
numbers of clusters than obtained with Unigene. 
Based on 1,387,573 EST and mRNA 
sequences, the clustering resulted in a total of 
110,744 different clusters and 128,917 singleton 
ESTs (Release 14, March 2010). 


Expression Profiling of the Porcine 

Transcriptome 

Although there are numerous methods for the 
study of the expression of specific genes, for 
studying gene expression on a genome-wide 
scale (i.e. for studying all transcripts within a 
sample - e.g. cell, tissue, etc. - simultan¬ 
eously), in essence only two fundamentally 
different methodologies are utilized. The most 
widely used technique is based on hybridiza¬ 
tion of the transcripts against an array of 
probes representing all the genes (Schena 
et al., 1995). The alternative method is to 
sequence a representative part of all the 
transcripts present within the sample being 
analysed and count the number of times a 
transcript is observed (Velculescu et al., 1995; 
Mardis, 2008). The first large-scale porcine 
expression profiling experiments were pio¬ 
neered using human microarrays (Medhora 
et al., 2002; Moody et al., 2002), rapidly 
followed by the design and use of porcine- 
specific microarrays (Bai et al., 2003; Nobis 
et al., 2003). Since that time, the number of 
porcine-specific microarrays, including com¬ 
mercially manufactured arrays, designed and 
used has increased dramatically (see Table 8.1 
for microarray data deposited at the Gene 
Expression Omnibus of the NCBI). The 
number of genes and transcripts investigated 
in these different studies varied dramatically, 
from a few hundred to thousand. However, 
even the studies in which the expression of a 
few thousand porcine genes was assayed suf¬ 
fered from being incomplete, as there are esti¬ 
mated to be 20,000-25,000 protein-coding 
genes in the pig genome. Moreover, there are 
multiple differently spliced transcripts for many 
of these genes. For comparison, the human 
genome contains about 23,500 protein-cod¬ 
ing genes and encodes more than 140,000 
different gene transcripts. Thus, it is only 
recently, after the completion of a draft 
sequence of the porcine genome, that it has 
become possible to design more comprehen¬ 
sive microarrays representing the majority 
of the porcine genes. For a recent more 
extensive review of the microarray studies per¬ 
formed in the pig, readers are referred to the 
paper by Tuggle et al. (2007). 
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Table 8.1. Microarray experiments deposited in Gene Expression Omnibus (GEO). The table provides 


information on the collection of 36 different microarray platforms used for the pig and deposited in the 
GEO, which is available at http://www.ncbi.nlm.nih.g 0 v/ge 0 /.The GEO is a public functional genomics 
data repository for microarray data. As of 16 March 2010, the GEO contains submissions for 148 porcine 
microarray expression data sets based on 38 different GEO platforms (GPLs). The table only shows the 

36 platforms related to gene expression microarrays. 

GEO 

accession 

no. 

Approximate 
no. of genes 
(max.) 

Probe type 

No. of samples 

Array description 

Contact 

GPL336 

870 

cDNA 

2 

Porcine Brain Library 
array 

Steven Paul 
Suchyta 

GPL518 

1,272 

cDNA 

6 

UIUC Porcine muscle 
plus 

Yewon Cheon 

GPL1209 

GPL1270 

1,021 

cDNA 

16 

10 

Porcine 1000 embryo 
gene array 
SAGE:10:Nlalll:Sus 
scrofa 

Christopher K. 

Tuggle 

GEO 

GPL1624 

2,423 

cDNA 

15 

PorkChip 2,600 cDNA 
array 

Kendra A. Hyland 

GPL1881 

12,302 

oligo 

200 

Qiagen-NRSP-8 
porcine oligo array 

Christopher K. 
Tuggle 

GPL2731 

3,456 

cDNA 

118 

Spotting_ 

muscle_21OCT03 

Laurence Liaubet 

GPL3461 

10,665 

oligo 

104 

Duke Operon Porcine 
10.5K Oligo Array 

Heather Anne 
Himburg 

GPL3533 

20,201 

oligo 

399 

[Porcine] Affymetrix 
Porcine Genome 
Array 

Affymetrix, Inc. 

GPL3585 

26,877 

cDNA 

10 

DIAS_PIG_55K2_v1 

Jakob Hedegaard 

GPL3594 

5,375 

cDNA 

6 

D1 AS_P 1 G_27K2_v2 

Jakob Hedegaard 

GPL3608 

26,877 

cDNA 

138 

DIAS_PIG_55K3_v1 

Jakob Hedegaard 

GPL3707 

10,665 

oligo 

30 

Pig_Array_Ready 

Oligo set vl .0 

Bhupinder Juneja 

GPL3729 

9,216 

cDNA 

84 

AGENAE_ 

PigGeneric2_9216 

Karine Hugot 

GPL3764 

192 

oligo 

36 

Porcine oligo micro 
array version 3 

Shila Mortensen 

GPL3970 

4,608 

cDNA 

24 

scag_scai Sus scrofa 
4.6K triplicate array 

Gwenola 

Tosser-Klopp 

GPL3971 

1,152 

cDNA 

28 

scag_scai Sus scrofa 

1.2K mono array 

Gwenola 

Tosser-Klopp 

GPL3978 

GPL4061 

GPL4262 

2,854 

cDNA 

46 

10 

2 

INRA Sus scrofa 4K 
SAGE:17:Nlalll:Sus 
scrofa 

SAGE:10:Sau3A:Sus 
scrofa 

Aqnes Bonnet 

GEO 

GEO 

GPL4872 

9,729 

oligo 

0 

SBTM Microarray 
Laboratory Operon 
Pig vl.O 

Vincent 

VanBuren 

GPL4930 

9,556 

oligo 

32 

Intestinal epithelial 
crypts and villi in 
conventional relative 
to germ-free pig 

H. Rex Gaskins 

GPL5171 

656 

oligo 

38 

Pork Quality Operon 

Mingzhou Li 


70-mer oligo array 


Continued 
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Table 8.1. Continued. 


GEO 

accession 

no. 

Approximate 
no. of genes 

(max.) Probe type 

No. of samples 

Array description 

Contact 

GPL5340 

9,944 

cDNA 

6 

Porcine testis cDNA 
microarray 060717 

Wen-chuan Lee 

GPL5374 

11,500 

cDNA 

8 

NLI_SSC_11.5K_ 
cDNA_V1 

Jin Zhang 

GPL5468 

9,290 

oligo 

0 

DIPROVAL - 
OPERON Sus 
scrofa AROS VI .0 

Roberta Davoli 

GPL5622 

1,699 

cDNA 

48 

SLA_PrV porcine 
DNA/cDNA 
microarray 

Laurence Flori 

GPL5948 

Unknown cDNA 
(non- 

sequenced 

cDNAs) 

16 

ASG Porcine jejunum 
spleen cDNA array 

Gabriele Gross 

GPL5972 

5,375 

cDNA 

0 

DJF Pig oligo 27K1 
verl 

Jakob Hedegaard 

GPL6173 

26,877 

cDNA 

134 

DJF Pig 55K vl 

Jakob Fledegaard 

GPL6472 

23,256 

oligo 

66 

Affymetrix GeneChip 
Porcine Genome 
Array probe-level 

Nicholas Eldon 
Hardison 

GPL6553 

Genomic oligo 

CNV 

24 

Nimblegen 385K pig 
array CGFI 

Jakob Hedegaard 

GPL6849 

>200 

oligo 


Porcine oligonucleo¬ 
tide microarray 
version 4 (POM4) 

Kerstin 

Skovgaard 

GPL7151 

17,100 

oligo 

0 

SLA/Immune 
Response/NRSP8 
Pig 70 mers 
Oligonucleotides 
3.8K+ 13.3K vl 

Karine Hugot 

GPL7435 

19,486 

oligo 

16 

Swine Protein- 

Annotated 

Oligonucleotide 

Microarray 

Catherine W. 

Ernst 

GPL7576 

>200 

oligo 

0 

Porcine oligonucleo¬ 
tide microarray 
version 4 (POM4) 
(Condensed 
version) 

Jayda Siggers 


cDNA, complementary DNA; oligo, oligonuleotide. 


The alternative method of global gene 
expression analysis, direct sequencing and 
numeration of the transcripts, circumvents the 
bias of only measuring those genes that have 
previously been identified and sequenced. 
The first methodology that used sequencing 
and counting of short tags derived from mRNA 
to analyse gene expression was called serial 


analysis of gene expression or SAGE (Velcu- 
lescu et a/., 1995). Zuelke and co-workers 
(Zuelke et a/., 2003; Blomberg and Zuelke, 
2004; Miles et a/., 2008) were the first to 
apply this technology to pigs for their study 
of gene expression during porcine embryonic 
development. In these studies, between 

80,000 and 100,000 SAGE tags were 
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sequenced, which represented 20,000-23,000 
putative porcine transcripts. However, because 
at that time the complete sequence of the por¬ 
cine genome was not yet available, the number 
of different genes represented by these tags was 
not known. More recently, SAGE has also been 
used for the identification of porcine long non¬ 
coding RNA (ncRNA) (Ren et a/., 2009). 
Although the SAGE technology circumvents the 
problem of the absence of sequence information 
for many porcine genes, and in principle allows 
an unbiased and sensitive analysis of gene 
expression, sequencing costs using traditional 
Sanger capillary sequencing prohibited extensive 
large-scale studies using this approach. 

The opportunities to pursue the approach 
of assaying gene expression by comprehensive 
transcript sequencing have been changed 
dramatically by the recent development of 
so-called next-generation sequence technol¬ 
ogy. In particular, the next-generation sequenc¬ 
ing technologies that generate millions of 
short-sequence reads, such as the Illumina GA, 
ABI SoliD and Helicos sequence technologies, 
are increasingly being used to study gene 
expression (Mardis, 2008). Next-generation 


sequencing of porcine mRNA (referred to as 
RNAseq) has been used to increase the identi¬ 
fication of porcine transcripts and to provide 
further data to obtain the correct gene models 
for the genes in the porcine genome. To this 
end, a number of studies have used tissues 
from individuals that represent clones of the 
female pig (TJ Tabasco) whose genomic DNA 
was used for the genome assembly (see below). 
A large number of different tissues (Table 8.2) 
derived from 21 cloned individuals is available 
at the University of Illinois (contact L.B. 
Schook). RNAseq analyses were initially per¬ 
formed for tissues known to exhibit high levels 
of transcriptome complexity, i.e. the brain 
(J. Beever, personal communication; Uenishi et 
a/., 2009) and the placenta (M.A.M. Groenen, 
L.B. Schook, O. Madsen and R.RM.A. 
Crooijmans, unpublished results). In addition, 
RNAseq analyses have also been conducted for 
a number of tissues from pigs of other breeds, 
including testis tissues from a male wild boar 
(M.A.M. Groenen, L.B. Schook, O. Madsen 
and R.RM.A. Crooijmans, unpublished results) 
and muscle and liver tissues from a Danish 
Landrace pig (Hornshoj et a/, 2009). As a 


Table 8.2. Tissue samples of clones from TJ Tabasco, a Duroc sow from Illinois. Clones were derived from 
TJ Tabasco using somatic cell nuclear cloning (SCNC). Ear notch fibroblasts were collected and used as a 
source of nuclear material for SCNC. The resulting embryos were collected and used to generate fetal 
fibroblasts for use in future studies and to support potential genetic modifications such as knockout or 
knock-in studies. Fetal fibroblasts were also used to generate TJ Tabasco clones through SCNC that were 
collected at various stages of fetal development. These staged fetuses have also been used to create full 
length cDNA (complementary DNA) libraries for RefSeq (Reference Sequence) studies. 


Origin of tissue 

Number 

Origin of tissue 

Number 

Origin of tissue 

Number 

Adipose 

2 

Fetuses 

1 

Ovary 

9 

Adrenal glands 

3 

Fibroblasts 

12 

Pancreas 

4 

Bladder 

3 

Frontal lobe 

3 

Pituitary 

6 

Bone 

1 

Heart 

18 

Placenta 

4 

Bone marrow 

1 

Hippocampus 

3 

Pons 

3 

Brain 

11 

Hypothalamus 

6 

Skeleton 

8 

Bronchial nodes 

1 

Ileum 

4 

Skin 

9 

Cerebellum 

6 

Inguinal lymph node 

1 

Small intestine 

4 

Cerebral cortex 

3 

Intestine 

10 

Spinal cord 

1 

Colon 

7 

Jejunum 

1 

Spine 

1 

Colon fecal 

1 

Kidney 

18 

Spleen 

18 

Dental pulp 

1 

Liver 

18 

Stomach 

8 

Duodenum 

1 

Lung 

18 

Thalamus 

3 

Ear notch 

10 

Lymph nodes 

7 

Thymus 

1 

Eye 

5 

Mammary (gland) 

5 

Thyroid 

1 

Fat 

5 

Medulla 

2 

Trachea 

3 

Fetal liver 

1 

Muscle 

8 

Uterus 

15 
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further tool for future functional genomics stud¬ 
ies, and to improve gene models in the pig, 
several groups initiated sequencing of cDNA 
libraries that had been enriched for full-length 
cDNAs. These include a normalized full-length 
cDNA library constructed and sequenced from 
a pool of 11 different tissues (kidney, liver, 
lymph node, cerebellum, placenta, colon, 
hypothalamus, frontal lobe, spleen, small intes¬ 
tine and lung; M.A.M. Groenen, L.B. Schook 
and R.F.M.A. Crooijmans, unpublished results) 
and the sequencing of clones derived from 28 
full-length-enriched cDNA libraries from 25 
different porcine tissue and cell lines, including 
brain, ovary, colon and hypothalamus from 
clones of TJ Tabasco (Uenishi et a/., 2009). 

The availability of the sequence of full- 
length transcripts of the porcine genome will 
greatly facilitate the correct identification of the 
transcription start sites (TSS) of the porcine 
genes. This is not only extremely important to 
obtain correct gene models, but also for the pre¬ 
cise localization of the porcine promoters. The 
present algorithms designed to predict regula¬ 
tory elements within promoters have often 
proven unsatisfactory to a large extent because 
they assume correct identification of the TSS of 
the genes being compared, something that 
often is not the case. Although full-length cDNA 
sequences already provide the necessary infor¬ 
mation for correct assignment of TSS, this is 
further enhanced by the use of cap analysis 
gene expression (CAGE; Shiraki et a/., 2003). 
As for RNAseq, the combined use of CAGE and 
next-generation sequencing (also referred to as 
deepCAGE) adds a further dimension to the 
methodology, and in particular enables the iden¬ 
tification of less frequently used alternative pro¬ 
moters and tissue specific promoters (de Hoon 
and Hayashizaki, 2008). In pigs, deepCage has 
been done on placenta, testis (M.A.M. Groenen, 
L.B. Schook and R.RM.A. Crooijmans, unpub¬ 
lished results) and macrophages (D.A. Hume 
and A.L. Archibald, unpublished results). 


Non-coding and Regulatory RNAs 
in the Porcine Genome 

Transcripts that do not encode proteins are 
referred to as non-protein-coding or non-coding 
RNAs (ncRNAs). A key question arising from 


the observation of widespread transcription is 
whether these transcripts are biologically func¬ 
tional. Increasingly, several specific classes of 
ncRNAs have been shown to be involved in a 
wide spectrum of regulatory functions, and an 
increasing number of such ncRNAs are being 
discovered in the genomes of metazoans 
(Mattick, 2009). The best known and most 
studied class of ncRNAs are the microRNAs 
(miRNAs), which have been shown to be 
involved in the regulation of many genes. In 
addition, numerous other classes of short RNAs, 
such as Piwi protein-interacting RNA (piRNA) 
and small nucleolar RNA (snoRNA), and RNAs 
derived from the XIST locus on the X chromo¬ 
some (xiRNAs), have been described (Filipowicz 
et a/., 2008). Likewise, long regulatory inter- 
genic ncRNAs are increasingly being studied, 
although it is not yet clear to what extent these 
are functional (Louro et a/., 2008). Wernersson 
et al. (2005) analysed genomic sequence data 
representing an estimated 50% of the porcine 
genome for the presence of conserved miRNA 
sequences. By comparison with the sequences 
present in the miRNA hairpin database (Griffiths- 
Jones, 2004), a total of 51 mature miRNA 
sequences could be identified. To identify novel 
pig ncRNAs, rather than only ncRNAs that are 
conserved in other species, Seemann et al. 
(2007) constructed a bioinformatics pipeline, 
EST2ncRNA, and searched within the 1 million 
porcine ESTs for potential functional ncRNAs. 
Within the 48,000 EST contigs (contiguous sets 
of overlapping DNA segments) and 73,000 
singleton ESTs, they identified 1399 different 
potential ncRNAs, 137 of which were homolo¬ 
gous to known ncRNAs and a further 270 of 
which overlap with existing human ncRNA pre¬ 
dictions. Based on 92 different non-normalized 
cDNA libraries, the highest number of ncRNA 
predictions was derived from developmental 
and neuronal tissues. This high number does 
not appear to be caused by the complexity of 
the libraries, as only a small number of ncRNAs 
were observed in the testis, a tissue normally 
considered to be among the tissues with the 
highest different number of expressed genes. 

In a preliminary analysis of chromosomes 
7 and 14, at the time when sequence coverage 
of these two chromosomes exceeded 95%, 
over 850 potential miRNAs were identified, as 
well as an additional 3000 putative ncRNAs 
(J. Gorodkin and M. Fredholm, personal 
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communication). Although the false discovery choice for the construction of comprehensive 
rate of the ncRNA prediction programs used in genome-wide linkage maps during the 1990s. 
this analysis (rnaz and RNAmicro) is relatively In the years following the publication of the first 
high, this nevertheless provides a good indica- porcine linkage maps, the growth of such link- 
tion regarding the abundance of such sequences age maps has slowed. Nevertheless, the number 
in the porcine genome. The fact that these two of markers added to these maps has steadily 
chromosomes together comprise 231Mb of increased through the further mapping of addi- 
sequence indicates that the porcine genome tional new microsatellites and other types of 
would be predicted to contain over 37,000 mostly anonymous polymorphic DNA markers, 
ncRNAs and over 1000 miRNAs. including amplified fragment length polymor¬ 

phisms (AFLFs) (Rothschild, 2004) and single 
nucleotide polymorphisms (SNPs) (Vingborg 
et a/., 2009). Currently, over 5000 loci includ- 
Porcine Linkage Maps ing several hundred genes are located on the 

different maps (www.thearkdb.org). Gradually, 
The first coordinated efforts to better under- efforts to increase the number of markers on 
stand the pig genome focused on the genera- the porcine map have shifted towards the use 
tion of linkage maps based on polymorphic of physical maps such as the RH (radiation 
DNA markers. The major contributors to this hybrid panel) maps (Yerle et a/., 1998, 2002; 
effort have been international collaborative Hawken et a/., 1999) and the BAC maps (dis- 
projects based in Europe - the PiGMaP consor- cussed in the next section), 
tium (Archibald et a/., 1995) and the related 
Nordic collaboration (Ellegren et a/., 1994; 

Marklund et aI., 1996), and the efforts of the 

USDA Meat Animal Research Center (Rohrer ^ Highly Continuous BAC Map of the 
et a/., 1994, 1996). These combined efforts Porcine Genome 

resulted in the placement of over 1500 poly¬ 
morphic genetic markers on the porcine link- Physical maps based on bacterial artificial chro- 

age map. However, integration of all the linkage mosomes (BACs) have provided the essential 

information from the different studies into a framework for the majority of eukaryotic 

single consensus map was not very practical genomes that have been sequenced to date 

and has never been attempted. The majority of (Green, 2001). The stability of these clones, 

the markers on the pig linkage map are micro- their size and their relative ease of use in a 

satellite markers, short sequences comprising standard molecular biology laboratory have 

1-4bp direct repeats of at least eight copies, been key to their successful application for 

Because of the abundance of such sequences in physical mapping. Five different porcine BAC 

the genomes of vertebrates and many other libraries are available (Table 8.3) providing an 

eukaryotes, these have been the markers of estimated 38x coverage of the porcine genome 

Table 8.3. Fingerprinted porcine BAC (bacterial artificial chromosome) clones used for constructing the 
porcine BAC map (Humphray eta/., 2007).The genome coverage is calculated based on the estimated 
genome size of the porcine genome of 2.56 Gb and the average insert size of the BAC clones for the 
particular library. 


Library 

Fingerprinted clones 

Genome coverage 

Reference 

CHORI-242 

103,758 

6.7 

http://bacpac.chori.org/ 




library.php?id=124 

RPCI-44 

61,281 

3.8 

Fahrenkrug etal., 2001 

PigE 

73,866 

4.2 

Anderson etal., 2000 

INRA 

28,467 

1.5 

Rogel-Gaillard etal., 1999 

KPN 

361 

0.02 

Jeon etal., 2003 

Other 

151 

0.01 

— 

Total 

267,884 

16.2 

Humphray et a!., 2007 
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(Rogel-Gaillard et al., 1999; Anderson et a/., 
2000; Suzuki et a/., 2000; Fahrenkrug et al., 
2001; Jeon et al., 2003). The development of 
a porcine BAC contig physical map by means 
of fingerprinting the individual clones (Schein 
et al., 2004) from two BAC libraries (RPCI-44 
and CHORI-242) both produced by Pieter J. 
de Jong, one made at the Roslin Institute 
(Anderson et al., 2000), and one produced at 
INRA (Institut National de la Recherche 
Agronomique) (Rogel-Gaillard et al., 1999), 
was undertaken through a coordinated interna¬ 
tional effort as a precursor to the pig genome 
sequencing project. The total number of BACs 
fingerprinted was 267,884, representing 
16.2x depth of the porcine genome based on 
its estimated size of 2.6-2.7 Gb (Schmitz et al., 
1992; Rogatcheva et al., 2008). The finger¬ 
printed BACs were assembled into 172 contigs 
covering an estimated 98% of the porcine 
genome (Fig. 8.1) (Humphray et al., 2007). 
This BAC fingerprint map constitutes the most 
highly contiguous BAC map of any mamma¬ 
lian genome constructed so far, with one single 
complete chromosome (Sscl3) represented by 
a single contig. The map is accessible through 
the Sanger web site (http://www.sanger.ac. 
uk/Proj ects/S_scrof a/ mapping. shtml). 

To enable the integration of the BAC 
and genome sequence maps, a total of 
620,089 BAC end sequences (BES) were 
generated from 335,463 BACs with an aver¬ 
age Q20 length of 635bp. These sequences 
comprised approximately 15% of the porcine 
genome and enabled the construction of a 
detailed human-porcine comparative map 
(Humphray et al., 2007; and discussed 
below). 


Genome Sequencing and Sequence 

Assembly 

The first region of the pig genome subjected to 
systematic sequencing was the major histo¬ 
compatibility complex (MHC) region around 
the centromere on chromosome 7 (SSC7). 
Initial sequencing concentrated on regions 
containing the classical MHC class I genes 
SLA1, 2, 3, 4, 5, 9 and 11 (Renard et al., 
2001). A contig consisting of 15 BACs derived 


from the INRA BAC library (Rogel-Gaillard et 
al., 1999) was sequenced, resulting in a con¬ 
tiguous sequence of 307 kb in which 11 genes 
were identified. Subsequent sequencing of 
those regions flanking this contig resulted in 
a further 670 kb of porcine MHC region 
sequence and the identification of an additional 
36 genes (Shigenari et al., 2004, Ando et al., 
2005). The sequence of the complete porcine 
MHC region on both sides of the centromere 
on chromosome 7 containing the class I, II and 
III MHC genes was published in 2006 (Renard 
et al., 2006). This 2.4Mb sized region, exclud¬ 
ing the centromere, contains 151 genes, of 
which 123 could be identified as orthologous 
to human MHC genes. 

The Sino-Danish pig sequencing consor¬ 
tium, although primarily focusing on sequenc¬ 
ing large numbers of ESTs (Gorodkin et al., 
2007), generated 3.84 million shotgun 
sequences derived from five different pig 
breeds: Hampshire, Yorkshire, Landrace, 
Duroc and Erhualian (Wernersson et al., 
2005). The number of sequences per breed 
varied from 257,000 for the Chinese Erhualian 
breed to 1.2 million for the Yorkshire (Large 
White) breed. The 3.84 million sequences rep¬ 
resent an estimated 0.66x coverage of the 
porcine genome. The low coverage and high 
diversity of the animal material used has pre¬ 
vented any meaningful assembly of the 
sequences, and the data are primarily a 
resource for SNF discovery (discussed below). 
Based on these sequences, the repetitive 
sequence content of the porcine genome was 
estimated to be around 34% (Table 8.4), which 
is similar to that of the mouse but lower than 
that of other mammalian genomes, such as the 
human, dog and cow. This is consistent with 
the smaller size of the porcine genome of 2.56 
billion bp as estimated from build 9 (July 2009, 
discussed below). There seems to be a clear 
correlation between the estimated genome size 
and the repeat content (Table 8.4), with the 
exception of the dog genome, which, from the 
mammals whose genome has been sequenced, 
is the smallest genome but has a repeat content 
similar to that observed in humans. Like the 
cow genome, the pig genome seems to have a 
relatively low number of LTR (long terminal 
repeat) elements compared with all the other 
mammals, as well as a relatively low number of 
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Fig. 8.1. Bacterial artificial chromosome (BAC) map of the porcine genome. The positions of the BAC 
contigs (contiguous sets of overlapping DNA segments) are indicated by vertical bars bars adjacent to 
the karyotype of the individual porcine chromosomes. 


long interspersed elements (LINEs), a feature 
that the pig shares with the mouse. 

The integrated highly contiguous physical 
map of the pig genome (Humphray et ai, 
2007) was used as a template for sequencing 
the porcine genome. Sequencing was primarily 
undertaken at the Wellcome Trust Sanger 
Institute at Hinxton, UK, with a hierarchical 


shotgun sequencing approach using the BAC 
clones from the BAC map. In order to minimize 
the number of BACs required to cover the com¬ 
plete porcine genome, BACs were identified in 
a series of iterative rounds. Briefly, pairs of 
BACs selected from the minimal tile path (i.e. 
the path through the overlapping clones in the 
physical map that represents greatest genome 
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Table 8.4. Comparison of genome size and repeat content of sequenced mammalian genomes. 

The repeat count for the pig is based on 0.66x genome coverage (Wernersson et al., 2005). Short 
interspersed elements (SINEs) and long interspersed elements (LINEs) together with the long terminal 
repeats (LTRs) are three different classes of repetitive elements that all transpose through an RNA 
intermediate (retrotransposition) as opposed to other types of transposons that transpose directly as 
DNA (i.e. DNA elements). Satellite repeats are a class of repetitive sequences mostly found within 
centromeric and pericentromeric regions. Simple repeats and low complexity repetitive sequences are 
mainly microsatellite and minisatellite repeats. 



Dog 

Mouse 

Pig 

Rat 

Human 

Cow 

SINEs 

7.96 

7.63 

11.3 

7.78 

13.14 

17.66 

LINES 

19.54 

16.46 

16.14 

20.1 

20.42 

23.29 

LTR elements 

10.39 

8.72 

2.8 

10.28 

8.29 

3.2 

DNA elements 

0.88 

0.36 

1.51 

0.86 

2.84 

1.96 

Unclassified 

0.32 

0.37 

0 

0.37 

0.14 

na 

Small RNA 

0.06 

0.04 

0.02 

0.03 

na 

na 

Satellite repeats 

0.04 

na 

1.47 

0.31 

na 

na 

Simple repeats 

2.39 

na 

0.62 

2.41 

na 

2.27 

Low complexity repetitive 

0.73 

na 

0.53 

na 

na 

na 

sequences 

Total 

42.31 

33.58 

34.39 

42.14 

44.83 

48.38 

Genome size 

2.45 

2.5 

2.56 

2.75 

2.85 

2.87 


na, not available. 


coverage in the smallest number of clones) and 
at multiple dispersed locations in the genome 
were sequenced; the sequence contigs from this 
first wave of sequenced BAC clones were used 
to identify a second wave of BAC clones from 
the minimal tile path exploiting the BAC end 
sequence data to confirm and minimize the 
overlaps between clones sequenced in the first 
and second waves. This selection process was 
repeated iteratively to extend and close gaps in 
the sequence map. The initial aim was to obtain 
a 4x sequence depth across the genome through 
a minimal tile path BAC-by-BAC approach (i.e. 
sequencing each BAC clone in turn and inde¬ 
pendently), with clones being preferentially 
selected from the CHORI-242 BAC library 
which had been generated from a single Duroc 
sow (TJ Tabasco). To further minimize the 
number of clones needed to cover the complete 
genome, a fosmid library was produced using 
TJ Tabasco DNA with an average insert size of 
40 kb. End sequences were obtained for the fos¬ 
mid clones in order to align these clones with 
the emerging sequence map. Fosmid clones 
were used to bridge the remaining small gaps in 
the genome sequence. In earlier releases of the 
porcine genome, the 2.4Mb sequence of the 
MHC region (Renard et a/., 2006) was incorp¬ 


orated, but in the latest release this sequence 
has been replaced by sequences derived from 
CHORI-242 clones. 

The current assembly (at the time of 
writing) of the draft pig genome sequence (build 
9; Sscrofa9) is accessible in the Ensembl 
genome browser (http://www.ensembl.org/ 
Sus_scrofa/Info/Index). This assembly was 
established from the BAC clone derived 
sequences as available in April 2009, and cov¬ 
ers about 89% of the pig genome. The Ensembl 
team established the first gene build for the pig 
as follows: (i) starting with 9277 pig proteins 
from RefSeq (the NCBI Reference Sequence 
database, excluding all the predicted models) 
and UniFrot (Universal Protein Resource) 
sequences of which 7144 aligned uniquely; (ii) 
and also starting with 19,384 pig cDNA 
sequences (after predicted cDNAs were 
removed), of which 11,930 cDNAs met the 
criteria (i.e. aligned with identity >97% and 
coverage >90%); (iii) from 1,532,435 pig ESTs 
of which 898,859 ESTs passed the same score 
cut-off (i.e. aligned with identity >97% and cov¬ 
erage >90%); and (iv) -130,000 additional 
proteins, mostly from other mammals, and 
-20,000 human Ensembl models, of which 
around 50% aligned with >90% coverage. All 
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the gene predictions were merged, giving pri¬ 
ority to pig-specific proteins to give a final gene 
set of 17,493 genes and 520 pseudogenes. 

The next assembly (SscrofalO), which will 
form the basis for the publication of a draft pig 
genome sequence, will incorporate not only 
sequence data from BAC clones that extend the 
coverage of the genome, but also whole genome 
shotgun sequence data generated by the Korea 
National Livestock Research Institute. The por¬ 
cine genome will be further improved by the 
incorporation of whole genome shotgun reads 
representing 24-fold genome coverage and 
derived from TJ Tabasco, generated using next- 
generation sequencing performed by the Beijing 
Genome Institute (BGI) using the Illumina GA 
sequencing platform, and consisting of 44-bp 
paired end reads totalling 66.6Gb of sequence 
data (Jun Wang, personal communication). 


Comparative Genomics 

As the number of genes mapped across the 
genomes of different species increased in the 
late 1980s, it quickly became apparent that 
the homologues of genes that co-localized on 
the same chromosome in one species were 
often also co-localized in other species; this 
phenomenon was referred to as ‘conserved 
synteny’ (Nadeau, 1989). Conserved syntenies 
were defined as homologous segments in dif¬ 
ferent organisms composed of at least two 
pairs of homologous genes located on the 
same chromosome, regardless of gene order. 
As the number of mapped genes increased fur¬ 
ther and, in particular, after the characteriza¬ 
tion of the complete genome sequence of 
multiple species, the definitions ‘conserved 
synteny block’ and ‘conserved synteny seg¬ 
ment’ were often used (Waterston et a/., 2002), 
although this nomenclature has not been used 
uniformly in genome sequencing papers pub¬ 
lished in the last decade. Within this chapter, 
the term ‘conserved block’ is used for regions 
that are on the same chromosome between 
species (e.g. pig chromosome 8 and human 
chromosome 4) and that, at the resolution 
used, are not interrupted by regions homolo¬ 
gous to other chromosomes. Where the 
homologous sequences and/or genes are in 


the same order in the two species, they are 
referred to as ‘conserved segments’. 

Conserved synteny between the porcine 
and other mammalian genomes, in particular 
that of humans, has already been used for 
almost 20 years to predict the location of 
genes and to identify candidate genes for 
important traits in the pig. The first example 
where this approach was used successfully 
was the identification of the RYR1 gene as 
the gene for the halothane locus on porcine 
chromosome 6 (MacLennan et a/., 1990; 
Fujii et a/., 1991; Otsu et a/., 1991). Other 
well-known examples where comparative 
mapping was successfully used to identify the 
candidate gene for the trait under investiga¬ 
tion in the pig include the identification of a 
mutation in the PRKAG3 gene (RN locus) 
responsible for the excess glycogen content in 
pig skeletal muscle (Milan et a/., 2000), and 
the identification of an SNP in the IGF2 gene 
as the causal variation underlying an imprinted 
quantitative trait locus (QTL) for backfat and 
muscle growth on porcine chromosome 2 
(Van Laere et a/., 2003). 

The development of a porcine-human 
comparative map accelerated with the 
increased efforts to map genes and ESTs 
(Fridolfsson et a/., 1997; Wintero et a/., 
1998; Rink et a/., 2002) on the porcine link¬ 
age maps (Ellegren et a/., 1993; Johansson 
et a/., 1995) and RH maps (Hawken et a/., 
1999; Robic et a/., 1999; Lahbib-Mansais 
et a/., 2000). The first comprehensive com¬ 
parative maps between the porcine and 
human genomes were obtained by bidirec¬ 
tional chromosome painting by means of 
fluorescent in situ hybridization using indi¬ 
vidual flow-sorted chromosomes (Rettenberger 
et a/., 1995; Goureau et a/., 1996). These 
results revealed the presence of at least 37 
conserved synteny blocks, which was some¬ 
what lower than observed for the bovine- 
human comparative maps (Hayes, 1995; 
Solinas-Toldo et a/., 1995). Although orthol- 
ogous genes mapped in both humans and 
pigs showed that several of these blocks 
consisted of multiple segments, the mapping 
resolution available at that time did not permit 
estimates regarding the number of conserved 
synteny segments between the human and 
porcine genomes. The first high-resolution 
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porcine-human comparative map that was 
able to identify conserved synteny segments 
within these larger conserved synteny blocks 
was derived from the RH mapping of 1058 
ESTs (Rinket a/., 2002). Using this approach, 
Rink et al. (2002) were able to identify at 
least 60 evolutionary break-points and 90 
micro-rearrangements between the genomes 
of humans and pigs. The availability of a 
high-resolution physical map based on finger¬ 
printed BACs (Humphray et al., 2007), and 
in particular the availability of the end 
sequences (BES) of many of these BACs, 
allowed the development of even higher reso¬ 
lution human-porcine comparative maps 
(Meyers et al., 2005; Humphray et al., 
2007). Meyers et al. (2005) used these 
resources to add 2068 BES to the RH map, 
thus further refining the resolution of the 
comparative map; they were able to identify 
51 conserved synteny groups and 173 con¬ 
served synteny segments between the 
genomes of humans and pigs. Using the def¬ 
inition of conserved synteny blocks presented 
here, the total number of conserved synteny 
blocks reported in that study is 65. Completion 
of the porcine genome sequence will further 
increase the resolution of the comparative 
map between the human and the pig. 
Comparison of the human genome sequence 
with the currently available pig genome 
sequence (build 9; Sscrofa9), which covers 
approximately 89% of the porcine genome, 
has been further extended by searching 
Sscrofa9 with 10 kb segments of the human 
genome sequence using the algorithm blat 
(blast-like alignment tool; Kent, 2002). The 
resulting comparative map (Plate 3) reveals 
additional evolutionary break-points as well 
as an additional number of (small) conserved 
synteny blocks not observed at previous reso¬ 
lutions, bringing the total number of con¬ 
served synteny blocks to 70 and the number 
of conserved synteny segments to 194. 
Because comparing the next pig genome 
assembly (SscrofalO) against other maps, 
including the linkage, RH and comparative 
maps, will form part of the quality checks on 
the draft pig genome sequence before its 
publication, the putative evolutionary break¬ 
points revealed by these analyses will be 
regions that merit careful checking. However, 


the modest number of conserved synteny 
segments suggests that the current assembly 
of the genome sequence data (i.e. build 9; 
Sscrofa9) is a good assembly. 

Variation in the Porcine Genome 

Establishing the complete sequence of the 
genome of any given species is extremely 
important as it allows the analysis of the com¬ 
plete gene content of that organism, thus facili¬ 
tating the dissection of the molecular basis of 
all aspects of the functioning of that particular 
species. Comparing the gene content and the 
evolution of genes and gene families between 
closely as well as distantly related species pro¬ 
vides further insights into understanding the 
molecular instructions that contribute to the 
development and functioning of a given organ¬ 
ism. Equally important is the characterization 
of the natural variation between the genomes 
of different individuals within a species. Genetic 
variation is central to the variation observed in 
traits within any given population, as well as a 
key that allows specific individuals to adapt to 
changes in the environment and eventually 
supports the emergence of new species. 
Furthermore, in pig breeding, the available 
genetic variation within the different pig popu¬ 
lations has allowed the development of specific 
breeds and lines, each with specific character¬ 
istics (traits), and supplies the raw material 
from which further improvement in productiv¬ 
ity, health and welfare can be built by the 
breeding industry. In this respect, the establish¬ 
ment of the genome sequence of a single indi¬ 
vidual is just the start, but it provides the 
necessary framework and reference to further 
examine the organization of the genome of a 
large number of individuals. 

Variation within the genome involves 
changes of single nucleotides (SNFs), varia¬ 
tion of repetitive sequences, e.g. at mini- and 
micro-satellites, and even variation in the 
numbers of regulatory sequences and genes 
(copy number variation or CNV). Some of this 
variation has already been used in the past to 
develop polymorphic markers to construct 
the necessary linkage maps or to study spe¬ 
cific genetic variation (Chapters 2-5). Probably 
the first large-scale identification of genetic 
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variation in the pig at a genomic scale was the 
development of large numbers of microsatel¬ 
lite markers that were used to construct link¬ 
age maps of all the pig chromosomes in the 
1990s (Rohrer et a/., 1994, 1996; Archibald 
et ai, 1995). Although useful as genetic 
markers, microsatellites, in general, do not 
contribute greatly to phenotypic variation. 
The majority of microsatellite markers devel¬ 
oped in the pig were based on the (CA) n motif, 
the most frequent type of microsatellite found 
in vertebrate genomes. The frequency of (CA) n 
microsatellites was found to be similar in the 
porcine and human genomes, with approxi¬ 
mately one every 40 kb for (CA) n microsatellite 
loci with more than 12 repeats (van Wijk 
et a/., 2007). These results were confirmed by 
a count based on genome build 9, represent¬ 
ing 89% of the genome (Table 8.5). 

Over the last 5-10 years, as a direct result 
of the improved automation of SNF genotyp- 
ing and the abundance of this type of marker, 
the emphasis in genetic studies has quickly 
shifted towards the identification and use of 
SNFs. Furthermore, SNP variation is thought 
to underlie most of the observed phenotypic 


variation, providing an even stronger stimulus 
for the discovery of this type of variation. 
Numerous studies have focused on the identi¬ 
fication of SNPs in specific genes, particularly 
in relation to candidate gene approaches for 
the analysis of quantitative traits (Rothschild 
et a/., 2007). The first studies to systemati¬ 
cally identify SNPs at a large scale used 
sequencing of PCR amplified fragments and 
focused on porcine genes (Fahrenkrug et a/., 

2002) , and a QTL region identified on the 
short arm of chromosome 2 (Jungerius et a/., 

2003) . The estimated SNP frequencies in the 
haploid genome based on these studies were 
one SNP every 609bp (Fahrenkrug et a/., 
2002) and one SNP every 357 bp (Jungerius 
et a/., 2003), respectively. The SNP frequency 
of one SNP every 357 bp derived from ran¬ 
dom genomic sequences is twofold higher 
than that found in the genome of Bos taurus 
(The Bovine HapMap Consortium, 2009) and 
twofold lower than in the chicken genome 
(Wong et ai, 2004). 

The large-scale EST sequence data have 
also been used for SNP mining (Uenishi et a/., 
2004, 2007; Panitz et ai, 2007; Vingborg et a/., 


Table 8.5. Dinucleotide microsatellite count per chromosome based on pig genome assembly build 9 
(Sscrofa9). Summary of distribution of TG, TA and TC type microsatellite repeats in the porcine genome. The 
TG type of repeat underlies the majority of the microsatellite markers used for linkage mapping in the pig. 


Chromosome no. 

Chromosome size (bp) 

TG>12 

TC>12 

TA>12 

1 

295,534,758 

6,342 

712 

1,977 

2 

140,138,545 

3,123 

344 

829 

3 

123,604,833 

3,071 

377 

621 

4 

136,259,999 

3,240 

388 

893 

5 

100,522,023 

2,351 

277 

611 

6 

123,310,224 

3,021 

368 

643 

7 

136,414,115 

3,266 

403 

803 

8 

119,990,724 

2,692 

301 

809 

9 

132,473,644 

3,094 

334 

702 

10 

66,741,983 

1,664 

195 

346 

11 

79,819,449 

1,847 

170 

433 

12 

57,436,398 

1,340 

146 

232 

13 

145,240,356 

3,187 

363 

929 

14 

148,515,193 

3,611 

441 

1,028 

15 

134,546,158 

3,029 

358 

843 

16 

77,440,712 

1,873 

190 

435 

17 

64,400,393 

1,575 

212 

426 

18 

54,314,914 

1,289 

107 

278 

19 

125,876,345 

3,066 

452 

1,144 

Total 

2,136,704,421 

52,681 

6,138 

13,982 
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2009). The majority of these SNFs, however, 
have not been deposited into NCBI’s SNP data¬ 
base (dbSNP), although they are available 
through a number of EST specific databases. 
Panitz et al. (2007) described the identification 
of 7900 candidate SNPs using a data set of over 
0.8 million ESTs (Gorodkin et al., 2007). 
Around 3900 of these SNPs were included on a 
7K Illumina iSelect beadchip (C. Bendixen and 
A. Archibald, personal communication), and 
genotyping results on a wide variety of breeds 
indicated a validation frequency of 83%. Similar 
validation frequencies were observed for SNPs 
derived from the PEDE (Pig Expression Data 
Explorer) and TGI (The Gene Index Project at 
the Dana Farber Cancer Institute) databases 
(M.A.M. Groenen, unpublished results). SNP 
identification based on a comparison of all the 
available porcine genomic sequences in 
GenBank (the US National Institutes of Health 
genetic sequences database, available at NCBI) 
resulted in the identification of 6374 SNPs, for 
which both variants were observed at least twice 
(Kerstens et a/., 2009). Similar to the putative 
SNPs derived from the EST sequence data, the 
conversion rate (i.e. the proportion of the puta¬ 
tive SNPs that could be validated and shown to 
be truly polymorphic) of these SNPs was 82%. 

As EST data are known to also contain 
mitochondrial DNA (mtDNA) sequences, this 
resource was examined for any variation in 
mtDNA-derived sequences (Scheibye-Alsing 
et a/., 2008). Although the authors described 
374 putative SNPs, their validation results indi¬ 
cate that the majority were false positives, and 
that the number of true, reliable mtSNPs with 
high conversion rates that were identified was 
112. Conversion rates for SNPs obtained from 
PCR-based re-sequencing efforts based on BES 
(A. Archibald, unpublished results) or from the 
sequence comparison of high-quality sequenced 
genome sequences (Amaral et al., 2008) were 
generally significantly higher (>95%). More 
recently, several large SNP discovery projects 
were initiated, each of which used a different 
sequence methodology and strategy. Denis 
Milan and co-workers (INRA, France) used tra¬ 
ditional Sanger sequencing to generate a total 
of 1 million sequences of Large White, 
Landrace, Pietrain, Iberic, Gottingen, Meishan 
and Wild Boar. In total, over 55,000 high- 
quality SNPs were identified with a conversion 


rate above 95% (D. Milan, personal communi¬ 
cation). Using next-generation sequencing on a 
Roche GS-FLX sequencer, Wiedmann et al. 
(2008) identified more than 100,000 SNPs 
with a conversion rate of more than 91%. The 
other two studies used Illumina’s GA next- 
generation sequencing technology to sequence 
reduced representation libraries (Van Tassell 
et al., 2008) prepared from pools of different 
individuals (Amaral et al., 2009; Ramos et al., 
2009). This resulted in over 390,000 SNPs, 
including estimates of the minor allele frequen¬ 
cies for these SNPs. Currently, 541,144 of the 
SNPs discovered in the pig have been submitted 
to dbSNP, representing around half a million 
unique SNPs. This resource of SNPs (Fig. 8.2) 
was used for the design of a porcine 60K 
Illumina Beadchip (Ramos et al., 2009), which 
is being used extensively worldwide within 
industry and academia for genomic selection, 
whole genome association studies, as well as 
within a large international porcine HapMap 
study. Recently, sequencing the complete 
genome of an individual boar to 26x using 
Illumina GAII next-generation sequencing 
resulted in an additional 2-3 million SNPs 
(Zhan et al., 2009). Furthermore, further 
sequencing of other breeds, as well as of indi¬ 
vidual pigs from different breeds, is expected to 
result in a resource of over 10 million porcine 
SNPs in the very near future. 

Although, SNPs have attracted most of the 
attention in respect of the genetic variation 
underlying phenotypic variation, over the past 
4-5 years it has become apparent that vertebrate 
genomes exhibit another type of variation. In 
addition to changes of a single base pair (SNPs), 
it is now clear that vertebrate genomes harbour a 
large number of structural variants (SV), including 
inversions, translocations, deletions and 
insertions. To date, such SVs have been most 
extensively studied in the human genome (for a 
recent review see Frazer et al., 2009) and the 
focus is often in particular on insertions and 
deletions (generally referred to as copy number 
variation, or CNV). In humans, it is estimated 
that up to 30% of the genome is affected by this 
type of variation. Currently, more than 38,000 
SVs have been identified in the human genome 
(http://projects.tcag.ca/variation). An initial 
CNV analysis of porcine chromosomes 4, 7, 14 
and 17 (Fadista et al., 2008) using array 
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Fig. 8.2. Distribution of the single nucleotide polymorphisms (SNPs) present on the 60K Beadchip. Black 
indicates the distribution based on pig genome assembly build 7 (Sscrofa7), available at the time of the 
chip design. Grey shows the distribution of the SNPs based on genome build 9 (Sscrofa9, covering 89% 
of the porcine genome). The x-axis represents the distance between the SNPs in bins of 5 kb. 


comparative genome hybridization with a probe 
spacing of 409 bp identified 37 CNVs with a 
size range of 2-62 kbp. Although these results 
clearly show the abundance of SVs in the 
porcine genome, a true comparison of the 
frequency of SVs in the pig with that in man has 
to await more systematic studies addressing the 
complete porcine genome. 


The Future of Porcine Genomics 

Our knowledge of the structure, function and 
variability of complex genomes, and the tools 
to further analyse genomes seem to be chang¬ 
ing at an ever increasing pace. While we are 
writing this chapter, pig genome assembly build 
10 (SscrofalO), representing around 98% of 
the porcine genome, is already around the cor¬ 
ner, which will further improve our knowledge 
of this important and fascinating mammal. 
Furthermore, it is likely that through the recent 
developments of next-generation sequencing 
technologies, within the next couple of years 


we will obtain the sequence of tens to hundreds 
of individual pigs from different breeds, thus 
providing further detailed insight into the 
genetics of this species. Many farm animals 
have seen their genomes sequenced before 
that of the pig (Hillier et a/., 2004; The Bovine 
Genome Sequencing and Analysis Consortium 
et a/., 2009; Wade et a/., 2009). What makes 
the porcine genome sequencing project dis¬ 
tinctive from recent genome projects is the fact 
that sequencing has been based on a directed 
approach of sequencing BACs ordered in a 
highly contiguous physical map. The resulting 
high-quality genome sequence allows for a 
detailed analysis of segmental duplications, 
rearrangements and SVs to an extent not fea¬ 
sible in many genomes whose sequences are 
based on whole-genome shotgun sequence 
approaches. With this resource at hand in 
2010, the further comparison of the genomes 
of additional pigs from a variety of breeds will 
provide a treasure trove not only to understand 
the genetic basis of important quantitative 
traits, but also to study further aspects related 
to speciation, domestication and selection. 
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Note Added in Proof 

In August 2010 the Swine Genome Sequencing 
Consortium announced the completion of a 


draft reference genome sequence for the pig 
(Sscrofa 10) and has published its plans for 
the analysis and publication of the sequence 
(Archibald et al., 2010). 
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Introduction 

McGlone et a/. (1998) wrote the first version 
of ‘Behaviour Genetics of the Domestic Pig’, 
which covered feeding, sexual, social, stress 
and abnormal behaviours. The aim of this 
chapter will be to update those areas and intro¬ 
duce new areas of behavioural genetics and the 
link to swine well-being, behavioural traits that 
may be useful for inclusion in a breeding pro¬ 
gramme and a detailed review of the genetics 
of maternal and aggressive behaviours. 

The wild pigs of Asia and Europe were 
selected for domestication in large part because 
of their behaviour. While there was consider¬ 
able variation in behaviour within the species, 
the behavioural traits that favoured domestica¬ 
tion included their omnivorous dietary needs, 
their medium body size, their typically (though 
not always) docile nature, their relatively weak 
maternal-neonatal bonds, their precocial 
nature and their general adaptability (Hale, 
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1969; Ratner and Boice, 1976). The domestic 
pig was selected from native strains of wild pigs 
in Europe and Asia. The wild ancestors, the 
oldest breeds of domesticated European 
pigs and common modern-day domesticated 
pigs share many behavioural characteristics 
(Mitichashvili eta/., 1991). Extensive research 
has been carried out to study the natural behav¬ 
iour of domesticated pigs and, in many cases, 
this has shown that fundamental aspects of 
behaviour differ only marginally from that of 
the wild ancestors with regard to their social, 
ingestive and exploratory behaviours (Jensen 
et a/., 2008). Different though from its wild 
ancestors, the domestic pig has been selected 
to be more calm, quiet and less active than wild 
pigs (Robert et a/., 1987). The very plasticity 
of the domestic pig allows it to grow and repro¬ 
duce in less-than-ideal, and often variable, 
environments. To survive and prosper requires 
a plastic and forgiving genome and, therefore, 
combining this plasticity with the modern 
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production environment will have ramifications 
on pig adaptability, which can then affect 
production, health and overall well-being. 

Behaviour is an integral part of biological 
regulation, and an important factor in swine 
production (Mormede, 2005). In turn, the 
study of behaviour genetics explores the very 
nature of a species. We know that species are 
distinguished from each other both by their 
morphology and by their behaviour. Thus, 
while sheep are flocking animals, pigs tend to 
travel in small herds. Ruminants forage, while 
pigs scavenge as they forage. Intense selection 
may not make the scavenger a grazer, but it 
might increase its rate of feed intake and thus 
its rate of scavenging or feeding. Within a spe¬ 
cies, and within the behavioural types of that 
species, selected breeds or strains of animals 
often have unique behavioural traits that are 
heritable, or more commonly referred to as 
breeding true (like the sheepdog versus the lap- 
dog). Pig behaviours vary among genetic lines 
within the species, some are lethargic, some 
hyperactive and many are in the middle of this 
range. The objectives of this review are to 
detail what is known today on pig behavioural 
genetics and to provide direction for further 
investigations. 


Behavioural Genetics and the Link 
to Swine Well-being 

The movement of pigs indoors has addressed 
some well-being concerns (Barnett et a/., 2001), 
while creating others (Newman, 1994). Under 
these intensive conditions, behavioural charac¬ 
teristics may be: (i) manifestations of the pigs’ 
biological and psychological reactions to change 
or uniformity of environment; (ii) instinctual 
behaviours that are sources of difficulty under 
certain restrictive systems; or (iii) behaviours that 
are no longer necessary in the new environ¬ 
ment. From Newman’s (1994) perspective, the 
genetics of behaviour may, in turn, provide clues 
to the well-being of the pig. To date, three main 
strategies have been used for the genetic 
improvement of swine: selection among breeds 
or strains, selection within breeds or strains, and 
crossbreeding. These breeding strategies have 
predominantly focused on production traits/ 


outcomes or economically important traits 
(Falconer, 1990), but production (pigs per sow 
per year, body weight and pre-weaning mortal¬ 
ity), physiology (blood profiles/immune status), 
anatomy (gait, bone density) and health are all 
interwoven, albeit directly or indirectly, with the 
behavioural repertoire of the pig. Moreover, 
there is more than just economic importance; 
there is concern about the undesired side effects 
of selection for high production efficiency and 
its association with swine well-being. Pig breed¬ 
ing organizations may need to balance these 
economically important traits with non-economic 
values such as emotional and societal issues 
(Kanis et a/., 2005). 

A new tool that could be used for many 
aspects of well-being is determining the molec¬ 
ular architecture of behaviour via quantitative 
trait locus (QTL) analyses. Knowledge of the 
molecular mechanism of behaviour might help 
to improve our understanding of behavioural 
problems such as aggression, environmental 
adaptation and overall swine well-being. 
Although there are limited QTL behavioural 
data for pigs, the Pig QTL database (PigQTLdb) 
and its peripheral tools make it possible to 
compare, confirm and locate on pig chromo¬ 
somes the most feasible location for the genes 
responsible for quantitative traits important to 
pig behaviour (PigQTLdb, 2010). A recent 
study by Reiner et ai (2009) highlights the 
importance of this tool and its possible future 
applications. These authors mapped QTLs for 
a variety of behavioural indices in swine under 
healthy conditions and after infection with 
Sarcocystis miescheriana. The study was con¬ 
ducted using a Pietrain x Meishan F 2 family 
and six QTL-controlled behavioural indicators 
under healthy conditions, and another six, 
eight and nine QTL-influenced behavioural 
indicators during acute, subclinical and chronic 
disease, respectively. For example, time spent 
rooting was influenced by only two QTLs on 
pig (Sus scrofa , SCC) chromosomes 7 and 9 
(SSC7 and 9), and solely during reconvales¬ 
cence from acute disease (on day 28); Pietrain 
alleles were associated with higher activities. 
Two QTLs on SSC7 and 8 showed effects on 
walking activity, but exclusively before infec¬ 
tion. Time spent performing social behaviour 
was influenced by one, two and two QTLs, 
before infection, during reconvalescence and 
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during chronic disease, respectively. QTLs 
were located on SSC6, 7, 11, 12 and 13 and 
explained up to 20.4% of F2-phenotypic vari¬ 
ance. Therefore, the authors concluded that 
behavioural indicators of pigs under healthy 
conditions and during subclinical, acute and 
chronic disease are influenced by multiple 
genes, and they suggested that future searches 
for QTL behavioural indices will be based on 
the founder swine breeds and on an environ¬ 
ment that allows the expression of different 
behavioural patterns. 


Behavioural Traits that may be 
Useful for Inclusion in a Breeding 

Programme 

Behavioural genetics is a fascinating and 
important area of research, fascinating because 
of the complex and rich behavioural repertoire 
of the pig, and important because of its bearing 
on swine well-being and productivity (Rydhmer, 
2005). Although the field of behaviour genet¬ 
ics has been around for several decades (Siegel, 
1976), and the behaviour genetics of farm ani¬ 
mals has been reviewed from time to time to 
synthesize the handful of literature (Siegel, 
1976; Hohenboken, 1986, 1987) before the 
mid-1990s, relatively few investigations have 
been reported that seek to better understand 
the genetic basis of pig behaviours. This may 
be due to numerous behaviour studies requir¬ 
ing an enormous time commitment. However, 
with increased automation for collecting behav¬ 
ioural data (Hyun et a/., 1997; Musial et a/., 
1999; Meiszberg et a/., 2009), more peer- 
reviewed papers have now been published, 
mainly addressing feeding, aggressive and 
maternal behaviours of the pig as they relate to 
behaviour genetics. Ranking behaviours of the 
pig that may in turn be added to current breed¬ 
ing value programmes continue to be discussed, 
although several stumbling blocks prevail. 
Mormede (2005) elegantly summarized these 
challenges. First, in pigs only a few genetic 
lines are available that have been selected for 
behavioural traits. Secondly, within-breed vari¬ 
ation is usually large and this important back¬ 
ground noise reduces the influences of 
individual loci. Thirdly, and perhaps a major 
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limit to behavioural changes in a breeding pro¬ 
gramme, is the limited basic knowledge about 
the psychobiological dimensions underlying 
behavioural trait variability, and the limited 
availability of reliable and meaningful measures 
of these that would be as free from environ¬ 
mental influences as possible. Fourthly, Jensen 
et al. (2008) noted that the age of the pig at 
which the study is performed may affect the 
trait to be selected, and that behavioural traits 
measured in young and adult pigs can have dif¬ 
ferent underlying motivations, which means 
that they are basically different traits. Further 
challenges identified are how these behavioural 
measures can be collected objectively and accu¬ 
rately, and how repeatable they are over time. 
A final challenge is that the behavioural trait of 
interest could receive a different importance 
weight allocated by the breeding company con¬ 
cerned, based on the housing system in which 
the pig is kept. For example, rooting and paw¬ 
ing behaviours performed by the sow when 
housed in a farrowing stall may receive less 
weighting than if the sow is to be housed in a 
more loose housing system. 


Maintenance Behaviours 
(Feeding and Drinking) 

The pig, as a species, is known for its relatively 
copious feed intake. Figs can reach feed intakes 
of 5% of their body weight, which exceeds the 
level of feed intake for most farm animal 
species (NRC, 1988). All other factors held 
constant, if we increase feed intake in young 
animals, we will not only improve the rate of 
weight gain but also do so at a greater 
feed efficiency. Thus, we clearly have great 
economic reason to increase feed intake in 
lean lines of pigs. The genetic basis for control 
of average daily feed intake (ADFI) has been 
less well studied than the closely correlated 
trait, average daily gain (ADG), but, because 
ADG and ADFI are highly correlated, selection 
would be successful if one were to select for 
feed intake alone. A complete review of the 
genetics of performance traits can be located 
in Chapter 14. 

Quite different problems of feed intake on 
commercial farms may show genetic variation 
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and thus are amenable to change. For example, 
adult sow feed intake on commercial high- 
quality diets can be too high, which wastes 
resources and, if left unchecked, causes exces¬ 
sive body fat and body size. The solution is to 
physically limit feed intake, which requires 
resources and alters sow behaviours. Another 
feed intake problem is the post-weaning feed¬ 
ing period in young pigs. Pigs do not usually 
consume much dry feed in the hours shortly 
after weaning. Gradually, over the first few 
days and weeks, feed intake increases to the 
species-typical level. A quicker start on dry feed 
would mean faster and probably more efficient 
weight gains. The problem of poor post-weaning 
gains was addressed by Akins (1989). She meas¬ 
ured feeding behaviour in the post-weaning 
period for pigs weaned at 28 days of age for 
half and full siblings; she measured feeding 
durations, number of feeding bouts and time 
from weaning to first feeding experiences. In 
addition, she calculated a slope of onset of feed 
intake with time after weaning as X and feed¬ 
ing time as Y. In this model, pigs with a larger 
slope of feeding duration over time after wean¬ 
ing would have a quicker onset of feed intake. 
All measures of post-weaning feeding behav¬ 
iours had very high heritability ( h 2 = 0.87) esti¬ 
mates that were significantly greater than zero 
(Table 9.1). In the same work, the h 2 for post- 
weaning ADG was 0.84 ± 0.14. The compari¬ 
son with post-weaning ADG suggested that the 
genetic variation for post-weaning feeding 
behaviour was similar to the genetic variation 
for ADG, although these h 2 values seem high 
compared with other research. Onset of feeding 


after weaning also showed significant h 2 for 
several measures. With standard errors of 
about 0.13, the estimates of h 2 ranged from 
0.31 to 0.89 for the onset of dry feed intake. 
These data suggest that selection for early 
onset of intake of dry feed would have as great 
a chance of success as selection for body 
weight gain. 

de Haer (1992) examined the genetics of 
feeding behaviour in group-housed growing 
pigs. Landrace males and females were com¬ 
pared with Yorkshires of both sexes for feed 
intake and feeding patterns. Landrace pigs 
spent more time at each meal; they ate fewer 
meals with larger intake per meal but had simi¬ 
lar overall feed intakes to the Yorkshire pigs. 
Von Felde et al. (1996) examined voluntary 
feed intake, feed intake pattern and perform¬ 
ance traits, which were recorded on 3188 
group-housed boars of Landrace and Large 
White tested from day 100 to day 170. 
Measurements of feed intake and feed intake 
behaviour were obtained by electronic feed dis¬ 
pensers (ACEMO) under ad libitum condi¬ 
tions. Heritabilities of feed intake were 
estimated to range from 0.16 to 0.30 over the 
trial. Hall et al. (1999) compiled 1832 indi¬ 
vidual feeding records from 70 sire families 
using a food intake recording equipment sys¬ 
tem. The authors noted that daily feed intake 
had an h 2 of 0.21 (range 0.18-0.26), but that 
feeding pattern traits were low (0.06 to 0.11), 
with the exception of feed intake per visit 
(0.27) and number of visits per day (0.34). 

When considering QTLs for feed intake 
analysis, feeding behaviour and growth traits, 


Table 9.1. Heritability estimates (on diagonal) and genetic (above diagonal) and phenotypic correlations 
(below diagonal) for selected behavioural traits in weanling pigs. Data were collected over 24 h after 
weaning and are from full-sibling analyses using Harvey’s (1987) procedures (Akins, 1989). 


Behaviour (measured in min) 


Type of behaviour 

Feeding 

Drinking 

Attack 

Submit 

FB 

Feeding 

0.87 ±0.13 

0 

0 

0 

0.25 

Drinking 

0.34* 

0.58 ±0.13 

0 

0 

0.14 

Attack 

-0.05 

0.13 

0±0 

0 

0 

Submit 

-0.09 

-0.09 

0.03 

0±0 

0 

FB 

0.25** 

0.34* 

-0.02 

0 

0.36 ±0.13 


‘Differs from zero, P < 0.01. 

“Differs from zero, P< 0.05. 

FB, number of feeding bouts during the first 3 days after weaning at 28 days of age. 
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Houston et al. (2005) examined an F 2 popula¬ 
tion derived from a cross between Chinese 
Meishan and European Large White pigs. The 
only QTL that reached genome-wide signifi¬ 
cance was located on SSC2 at 28 cM, and this 
affected daily feed intake during the 55 to 
80 kg growth period. The authors reported 
some suggestive evidence for QTLs that 
affected average daily gain (chromosomes 8, 
11 and 17), daily feed intake (chromosomes 
11, 13 and 17), feed conversion ratio (chro¬ 
mosomes 11, 12 and 14), daily feeding time 
(chromosome 6), average feed per visit (chro¬ 
mosomes 11 and 15), and average feeding 
rate (chromosomes 3 and 14), but these would 
need to be followed through in more detail. 

Recently, Reiner et al. (2009) also 
reported that feeding behaviour in pigs was 
associated with eight QTLs. Baseline feeding 
behaviour was influenced by QTLs on SSC5, 
18 and 7, and together they explained more 
than 40% of F 2 phenotypic variance. During a 
disease challenge, these effects vanished in 
favour of QTLs on SSC6, 8, 9, 16 and X. With 
one exception on SSC6, feeding behaviour 
was always positively influenced by Meishan 
alleles. The genetics of feeding behaviour 
should not be considered in isolation from the 
genetics for drinking behaviour, as pigs are 
prandial drinkers. Reiner et al. (2009) has 
begun addressing the QTLs associated with 
drinking in pigs, and reported that it is influ¬ 
enced by six QTLs on SSC5, 7, 11, 12, 15 
and 16. None of these QTLs influenced the 
drinking behaviour of healthy pigs before infec¬ 
tion. Two, three and one QTLs, respectively, 
showed effects during acute, subclinical and 
chronic sarcocystosis. With one exception (the 
QTL on SSC12), Pietrain alleles were associ¬ 
ated with reduced drinking behaviour. During 
the subclinical stage of infection (day 28), three 
QTLs explained about 65% of F 2 phenotypic 
variance of this behavioural indicator. The QTL 
on SSC7 alone explained 27% of variance. 

Zhang et al. (2009) has also identified 
QTLs for feed consumption and feeding behav¬ 
ioural traits in pigs. ADFI, feed conversion ratio 
(FCR), number of visits to the feeder per day 
(NVD) and average feeding rate (AFR) were 
recorded in 577 F 2 animals from a White 
Duroc x Chinese Erhualian resource popula¬ 
tion during the fattening period of 120 to 240 


days. A whole-genome scan was performed 
with 183 microsatellites covering the pig 
genome across the entire resource population. 
A total of eight QTLs were identified on five 
pig chromosomes, including three genome¬ 
wide significant QTLs for FCR on SSC2, 7 and 
9, one significant QTL for ADFI on SSC3, and 
one for NVD on SSC7. These QTLs were iden¬ 
tified for the first time, except for the QTL for 
FCR on SSC2. Four of the five significant QTLs 
were adjacent to the known QTLs for growth, 
carcass and fat deposition traits, supporting the 
existence of gene(s) with pleiotropic effects on 
these traits. White Duroc alleles were generally 
associated with greater phenotypic values, 
except for those on SSC7 and 9. Comparison 
of QTLs for feed consumption and feeding 
behaviours indicated that distinct chromosomes 
had effects on the two types of traits. The 
authors noted that characterization of the caus¬ 
ative gene(s) underlying the identified QTL would 
shed new light on the genetic basis of feed con¬ 
sumption and feeding behaviours in pigs. 


Sexual Behaviours 

Sexual behaviours include courtship behaviours 
by both sexes, male and female reproductive 
techniques and female expression of oestrus. 
Each of these categories of reproductive behav¬ 
iour is possibly under genetic control in part or 
whole. Goy and Jakway (1969) espoused the 
merits of gaining a better understanding of the 
genetic basis of sexual behaviours, but little 
direct work has been reported in pigs. Even 
reviews (for example, Bichard and David, 1985) 
of the genetics of prolificacy have not appreci¬ 
ated the role of behaviour in its significant con¬ 
tribution to prolificacy. An excellent review by 
Flowers (2008) discusses the current under¬ 
standing of phenotypic variation in reproduc¬ 
tive traits of Al (artificial insemination) boars. 
The proportion of boars that cannot be trained 
for collection in commercial studs is low, and 
differences among genetic lines are small. In 
contrast, there is a considerable variation in 
sperm production and significant differences 
are present among genotypes. In general, sex¬ 
ual behaviours in boars have not been studied 
to the same extent as other reproductive traits. 
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The majority of the published work performed 
in the late 1970s concluded that crossbred 
boars exhibited greater libido than their pure¬ 
bred counterparts, and that boars with Duroc in 
their pedigrees were less interested in mounting 
and more reluctant to mate than males with 
Yorkshire breeding. We do know that certain 
breeds of pigs express oestrus, including oestrus 
behaviour, at a very young age. The Meishan 
breed of pigs from China shows much earlier 
oestrus onset than traditional European breeds 
of pigs. Reasons for this are not known for the 
most part. Onset of oestrus after weaning seems 
to have a genetic component, ten Napel et al. 
(1995) reported realized heritabilities of 0.17 to 
0.36 ± 0.05 for weaning to oestrus interval. 
This trait might be amenable to selection or the 
identification of the major genes involved. The 
genetic mechanisms of sexual techniques such 
as mounting, mounting efficiency, coitus, lordo¬ 
sis and others, although generally thought to be 
innate in part, are not well understood. Male 
libido, at least, is an economically important 
trait, probably closely tied to gamete produc¬ 
tion. Male libido is another area that should be 
investigated for genetic variation. A complete 
review of the biology and genetics of reproduc¬ 
tion can be found in Chapter 10. 


Maternal Behaviours 

A preferred measurable trait utilized in most 
breeding programmes is the number of piglets 
born or born alive per litter. Selection for an 
increased litter size at birth does not necessarily 
translate into a direct increase in the number of 
weaned pigs if the capacity of the sow to gestate 
and then take care of her litter is a limiting factor 
(Hogberg and Rydhmer, 2000). Knol et al. 
(2002) noted that essentially nothing is known at 
present about the biological background of differ¬ 
ences in genetic merit for maternal effects and 
mothering ability. Yet the sow-piglet reciprocal 
relationship through their performed behaviours 
is a critical aspect for successful swine production 
as it contributes to pre-weaning mortality and 
growth of the piglets (Simm et al., 1996). 

The sow is an unusual ungulate, not only 
because she is polytocous and builds a nest, but 
also because licking of the fetal membranes of 


the newborn is rare. The genetic merit of esti¬ 
mated breeding value of mothering ability 
(EBV ma ) can be calculated as the maternal 
genetic effect of the foster sow on piglet sur¬ 
vival until weaning and expressed as a devia¬ 
tion from the population herd, or even litter 
mean (Uitdehaag et a/., 2008). Knol et al. 
(2002) concluded that the sow’s EBV ma rather 
than the piglet’s genetic merit for vitality is 
related to the litter average time interval 
between piglet expulsion and first colostrum 
intake. Piglets born from a group of sows with 
a relatively high EBV ma had a shorter mean 
interval from birth until first colostrum intake 
than piglets born from sows with a lower EBV ma 
(40 min versus 100 min). Furthermore, these 
authors noted that the interval from birth until 
first colostrum intake between sows with differ¬ 
ent EBV ma values was not related to differences 

Ilia 

in udder and teat morphology. 

Several workers have studied the effects of 
sow rolling (lateral to sternum lying) on piglet 
mortality (Weary et a/., 1996a,b; Marchant et a/., 
2001). Uitdehaag et al. (2008) hypothesized 
that, during parturition, sows with a higher 
EBV ma would spend more time lying laterally, 
spend less time standing and have fewer postural 
transitions than sows with a low EBV ma . These 

Ilia 

authors found that during parturition sows with a 
high weaning survival spent more time sitting 
and less time standing than sows with a low 
weaning survival, indicating that maternal behav¬ 
iour during parturition has an effect on piglet 
survival at weaning; however, no effect of EBV ma 
on maternal behaviours data collected during 
parturition was observed in this study. 

Optimal sow maternal behaviour after par¬ 
turition is a critical component of piglet survival 
and sow longevity within the breeding herd. It is 
likely that natural selection in the wild boar 
favours a willingness to protect piglets from 
intruders and predators, as suggested by the¬ 
ories that emphasize the adaptive functions of 
aggression- and fear-related defensive behav¬ 
iours (Grandinson et al., 2003). Although the 
original function for such behaviour is piglet 
defence, it can be a practical problem when the 
behaviour is directed either at the stockperson 
or at the piglets themselves. 

The most dramatic form of maternal 
aggression is cannibalism. This involves a sow 
biting, killing and eating newborn piglets. 
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Three behaviour subtypes, varying in degrees 
of expression, can be observed for cannibalistic 
sows. One form of cannibalism occurs when 
the sow is hyper-reactive following piglet birth 
and responds with agitation to piglet activities, 
including their vocalizations. Piglets killed in 
this situation may be wholly or partially eaten. 
The second form of cannibalism resembles 
neonatal rejection. The sow persistently avoids 
her piglets and this avoidance behaviour can 
lead to aggression directed towards piglets 
when they approach her. The third form of 
cannibalism in the sow resembles the general 
puerperal aggression condition. This resembles 
the mental illness sometimes occurring after 
childbirth in humans. The affected sow is 
hyperactive following parturition and shows 
aggression towards people or piglets that come 
within her range, and snaps aggressively at any 
intruding piglets. The aggressive piglet biting 
usually leads to the death of the entire litter. As 
with the previous forms of cannibalism, piglets 
killed by biting may be partially or wholly eaten, 
or left. All of these cannibalism forms expressed 
by sows are associated with hyper-excitability 
and are limited primarily to sows (gilts) in their 
first parity, although the behaviour can be 
observed in older experienced sows as well. 
Once cannibalism has started, the behaviour is 
likely to continue until the litter is lost entirely. 

Two recent QTL studies have begun 
addressing maternal infanticide in sows. An 
affected sib-pair whole-genome linkage analysis 
was carried out with 80 microsatellite markers 
covering the 18 porcine autosomes and the X 
chromosome, with the aim of identifying chro¬ 
mosomal regions responsible for this abnormal 
behaviour (Quilter et al., 2007). The results 
identified four QTLs mapping on SSC2, 10 
and X (SSCX). Chen et al. (2009) studied 
maternal behaviours from 5h before to 24 h 
after parturition in 288 White Duroc x Erhualian 
intercross F 2 sows over three continuous farrow- 
ings. In the F 2 population, 12.8% gilts showed 
maternal infanticide in their first litter, while the 
incidences of maternal infanticide at their sec¬ 
ond and third farrowing reduced to 7.5% and 
4.5%, respectively. All F 2 sows were genotyped 
for 194 microsatellite markers spanning the 
whole pig genome. The authors identified that 
seven chromosome regions on SSC2, 6, 14, 15 
and X were significantly linked with maternal 


infanticide (P < 0.05). The QTLs on SSC2 and 
SSCX achieved P < 0.01. Chen et al. (2009) 
noted that the most promising QTLs, however, 
were detected on the X chromosome, where 
there were three peaks of negative logarithm 
P-values located at markers SW980, SW2456 
and SW1608. 

Protectiveness by the sow towards her 
piglets has been measured by how the sow 
reacts to fear and reaction to piglet screaming. 
Vangen et al. (2005) estimated heritabilities in 
a Norwegian herd for several maternal traits 
that could affect piglet survival. Heritability 
estimates were 0.00 for carelessness against 
piglets, 0.12 for reaction to piglets screaming, 
0.17 for fear during routine management and 
0.22 for protests by the sow when moved to 
the farrowing pen. Several authors have 
reported the importance of h 2 for maternal 
behaviours over lactation that may influence 
sow productivity and, in turn, longevity within 
the breeding herd. Grandinson et al. (2002) 
estimated the phenotypic and genetic relation¬ 
ships between the sow’s reaction to a piglet 
screaming and early piglet survival/growth. 
Twenty per cent of the sows did nothing, 47% 
reacted by looking for the sound, 12% sat up 
and 21% stood up. In addition, the females 
exhibiting the greatest response to the scream 
test included gilts and parity five and older 
sows. Gade et al. (2008) collected data from 
32 nucleus and multiplier herds in Germany to 
estimate variance components and breeding 
values for five maternal behaviour traits in 
sows. A single-trait analysis was implemented 
to estimate these values using a threshold ani¬ 
mal model. A total of 31,000 farrowings 
recorded from December 2003 until July 2005 
were included. The h 2 estimates were 0.07 
(se 0.06) for group behaviour, 0.06 (se 0.03) 
for attitude to people and 0.05 (se 0.01) for 
maternal ability. Additionally, genetic correla¬ 
tions between the behaviour traits themselves, 
and between the behaviour traits and litter size, 
respectively, were estimated using restricted 
maximum likelihood (REML) procedures in 
which a linear model was implemented in the 
multivariate analysis. Low heritability and weak 
genetic correlations with litter size at birth 
indicate that it may be difficult to genetically 
improve maternal behaviour, and that selection 
for better mothering ability is not necessarily 
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accompanied by changing litter size at birth. In 
another study, Grandinson et al. (2003) 
recorded four behaviour traits that may play a 
role in maternal ability, and calculated h 2 esti¬ 
mates for the sow that included: (i) a sow’s 
response to the sound of a screaming piglet; (ii) 
a sow’s response towards the stockperson han¬ 
dling her piglets; (iii) avoidance of; and (iv) 
aggression towards the stockperson during the 
handling of the piglet (Table 9.2). The results of 
this work indicate that selection for a strong 
response in the scream test or selection against 
sows that avoid humans would result in a cor¬ 
related genetic improvement in piglet survival. 
Avoidance may reflect underlying fear, and 
selection for lower levels of fear in sows would 
improve both sow and piglet well-being. 

The complex phenomenon of pre-weaning 
mortality has been well researched, with 30% of 
the losses attributable to a single factor, for exam¬ 
ple a piglet being trapped by the sow, and 70% 
to multiple factors, for example a piglet missing a 
meal, becoming lethargic and then being trapped 
by the sow (Le Dividich et a/., 1996), but the 
reasons why sows crush and kill their young are 
as yet unexplained. Ethological theories relate to 
the sow farrowing so many young that some are 
dispensable. Other theories reflect that the envi¬ 
ronment or nutrient requirements are not met, 
or that predators are close, which can increase 
sow excitability (van der Steen et al., 1988). 
Crushed piglets not only represent one of the 
most important factors limiting sow productivity, 
but also present a substantial obstacle in any 
attempt to improve piglet well-being (Arey, 
1993) (Table 9.2). These undesired aspects in 


the domestic sow’s maternal behaviour may have 
arisen during the domestication, especially dur¬ 
ing the intense selection breeding regimes for 
economically important traits that have been 
prevalent in the past few decades. 

Spinka et al. (2000) conducted a study 
comparing domestic (Yorkshire sire x Dutch 
Landrace mothers) and wild (sire) x domestic 
crossbred sows and asked: (i) ‘did maternal 
behaviour change during domestication?’; and 
(ii) ‘can the inter-individual variability of mater¬ 
nal behaviour be subsumed into a few dimen¬ 
sions of maternal temperament?’. In this study, 
the crossbred wild x domestic sows did not differ 
from the domestic sows in any aspect of mater¬ 
nal behaviour except for a higher tendency to 
terminate nursing and a higher frequency of 
changing posture from lying to standing and 
back during the night. Factor analysis (based on 
a correlation matrix of 11 behaviours and corti¬ 
sol variables calculated for all sows after remov¬ 
ing the effect of breed) indicated that 82% of the 
variability in the data could be explained by 
three factors: (i) ‘calmness’, of which sow night¬ 
time frequency of major posture changes, care¬ 
fulness of laying-sow behaviour and a high 
propensity to remain in the nursing position 
after nursing is completed loaded positively; (ii) 
‘protectiveness’, which had high loadings for 
reaction scores to the playbacks of piglet dis¬ 
tress calls and human presence near the piglets; 
and (iii) ‘nursing activity’, which was strongly 
positively associated with nursing frequency. 
The authors concluded that most pig maternal 
behaviour aspects have not been significantly 
changed by domestication, and that substantial 


Table 9.2. Summary of heritability estimates from different studies for key maternal behaviours of the 
sow towards her piglets during farrowing and lactation. 


Author 

Trait 

Heritability 

Knap and Merks (1987) 

Aggressive behaviour of sow to piglets 

0.7-0.9 

van der Steen et al. (1988) 

Aggressive behaviour of sow to piglets 

CO 

■ 

o 

1 

■ 

o 

Hen (1996) 

Aggressive behaviour of sow to piglets 

0.17 

Grandinson et al. (2002) 

Scream test 

0.04 


Crushing 

0.07 

Grandinson et al. (2003) 

Avoidance 

0.08 


Handling test 

0.01 


Scream test 

0.06 

Gade et al. (2008) 

Aggressive behaviour of sow to piglets 

0.02 


Crushing 

0.03 

Hellbrugge et al. (2008) 

Crushing 

0.03 
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maternal behaviour variability exists between 
sows, perhaps in the form of several behaviour 
characteristics that encompass both behaviour 
and endocrine profiles of the sows. 

Although the sow can be the main cause of 
pre-weaning mortality, either directly through 
crushing (Marchant et a/., 2000) or indirectly via 
starvation (Algers, 1994; Weary et a/., 1996a), 
the piglets’ behaviours and interactions with their 
dam can also contribute to their survival to wean¬ 
ing success. In populations with reliable registra¬ 
tion of farrowing survival and pre-weaning 
survival, genetic variation can be quantified by 
the breeding value estimates for pig survival 
(EBV ps ). Estimated breeding value for pig survival 
for an individual pig represents its genetic merit 
to survive from parturition until weaning; values 
for individual pigs may, among other factors, be 
related to birth weight (Roehe et a/., 2009), 
farrowing progression (Leenhouwers et a/., 
2001) and the newborn piglet early postnatal 
behaviour (Fraser et a/., 1997). Leenhouwers 
et a/. (2001) reported that farrowing survival (P 
= 0.007) and early postnatal survival (P = 0.027) 
both increased with increasing EBV ps . Birth 
intervals tended (P = 0.10) to increase with 
increasing EBV ps , and duration of farrowing was 
not related to EB\/ . Time until first teat in mouth 

ps 

increased with increasing EBV ps (P = 0.05), 
but the other piglet vitality behavioural indicators 
(first upright standing, time to first contact with 
udder and time to first colostrum intake) were 
not related to EBV ps . The authors noted that 
information on these survival-related factors in 
piglets with known EBV ps values will give 
insight into the biological background of genetic 
differences in piglet survival, and will contribute 
to an improved selection and management 
strategy. 


Stress-related Alarm, Fear and Other 

Emotional Behaviours 

From stress to the psychobiology of 

adaptation 

Responses to environmental challenges involve 
different types of mechanisms: psychological 
(emotional and cognitive), biological (nervous 
and neuroendocrine) and behavioural (fight 


and flight). Each of these includes a set of 
specific changes directly related to the stimulus 
as well as non-specific adaptations, usually 
referred to as stress responses. A great degree 
of variability exists among individuals of the 
same species (Mormede et a/., 2002). In gen¬ 
eral, the concept of stress derives from the 
study of the physiological adjustments neces¬ 
sary to maintain homeostasis in a fluctuating 
environment. Walter B. Cannon (1935) noticed 
that the specificity of the sympathetic nervous 
system response usually seen with most stimul¬ 
ation was lost when the stimulus intensity over¬ 
flowed the normal regulatory mechanisms, or 
in the case of an intense emotional activation, 
such as the case of a cat exposed to a barking 
dog. The cardiovascular and metabolic actions 
of the sympathetic nervous system were con¬ 
sidered as necessary adjustments allowing an 
efficient behavioural adaptation, the ‘fight or 
flight’ response. Selye (1936) described the 
activation of the adrenocortical gland, in which 
glucocorticoid hormones were released under 
the control of the anterior pituitary gland and 
of the paraventricular nucleus of the hypothal¬ 
amus (the hypothalamic-pituitary-adrenal 
(HPA) axis), as a non-specific response to a 
number of stimulations. Mason (1971) demon¬ 
strated that this non-specificity response was 
primarily the result of emotional activation by 
the environmental stimulus. Thus, stress 
research moved from pure physiology to psy¬ 
chobiology, with physiological responses being 
intimately interconnected to behavioural 
adjustments. Both response outputs are the 
coordinated expression of a central emotional 
state induced by environmental stimulations. 
However, these responses are not stereotyped, 
but depend upon the specific individual reactiv¬ 
ity pattern and the behavioural control efficiency 
over the stimulus (Dantzer and Mormede, 
1983). For an excellent review of molecular 
genetic approaches to investigating individual 
variations in behavioural and neuroendocrine 
stress see Mormede et al. (2002). 

Genetics of emotional behaviours 

Behavioural scientists have devised different tests 
to determine individual behavioural reactivity 
features in experimental settings. These include 
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the open field test, the human approach test, the 
tonic immobility test, the elevated plus maze test 
and the emergence test (Anderson et a/., 2000). 
Numerous behavioural measures can be col¬ 
lected on each animal while undergoing these 
tests; for example, ambulation score, explora¬ 
tion time, vocalizations, faecal boluses and urina¬ 
tions are the classical measures of the emotional 
output, and involve both behavioural and auto¬ 
nomic responses. The tests can be further com¬ 
plicated; for example, if animals are placed into 
the open field test in the presence of food in the 
arena, to create a motivational conflict (neopho¬ 
bia test, Mormede et a/., 1984), or by adding 
new objects during the test (Lawrence et a/., 
1991; Jensen et a/., 1995a,b). Behavioural 
measures can be complemented by collecting 
physiological measures; for example, the adreno¬ 
cortical axis response to such challenges can be 
measured by comparing plasma levels of adreno- 
corticotrophic hormone (ACTH) and cortisol 
before and after the test. 

Several divergent selection experiments 
involving laboratory rodents to measure emo¬ 
tionality have taken place, and rodents that 
display a low motor activity and a high elimi¬ 
nation are considered more emotional (Hall, 
1934; Archer, 1973). Furthermore, these 
emotions have a genetic component 
(Broadhurst, 1962; DeFries et a/., 1966). 
However, little work has been done to deter¬ 
mine emotionality and the link to behavioural 
genetics in pigs. Von Borell and Lade wig 
(1992) reported that the open-field score vari¬ 
ability was larger between litters than within 
litters. Comparison among different breeds 
demonstrated considerable variation (Mormede 
et a/., 1984). Mormede et a/. (1994) studied 
F 2 intercrosses between Large White sires 
(with a high score of activity in the open-field 
test) and Meishan dams (with a low score in 
the test) in a novel environment and found that 
the character was normally distributed, sug¬ 
gesting that the trait is under multigenic con¬ 
trol. Moreover, Large White pigs displayed 
higher defecation and locomotion scores than 
Meishan pigs, and crossbred animals were 
intermediate for locomotion scores, suggest¬ 
ing that there were no dominance effects for 
this behavioural trait. For defecation scores, 
Meishan pigs were behaviourally dominant 
over Large Whites. The h 2 estimate for the 


locomotion score, calculated as four times the 
paternal half-sib correlations, was approxi¬ 
mately 0.16. In another study, Barnett et al. 
(1988) showed that genetic selection for 
growth performance resulted in an increase in 
activity, but no change in exploratory activity, 
in a novel environment, together with a mod¬ 
erately increased feeding time in this novel 
environment, and a reduced motivation to 
interact socially. 

Shea-Moore (1998) utilized the open-field 
test for segregated early-weaned pigs and two 
levels of body leanness. One group was selected 
for high levels of lean gain (Hi-L) and the other 
was selected for low levels of lean gain (Lo-L). 
Pigs were tested for 5 min, and vocalization, 
defecation and activity level were recorded. 
Salivary cortisol samples were collected imme¬ 
diately after the open-field test and also at 15, 
30 and 45 min after the behavioural test. The 
results indicated higher levels of activity in the 
Lo-L treatment group than in the Hi-L group 
(P < 0.05). However, there was large variation 
in the number of vocalizations regardless of 
treatment and, because of this variability, no 
treatment differences were detected. Higher 
levels of activity usually suggest less anxiety 
and more interest in exploration of the envir¬ 
onment, yet the level of defecation indicated 
an increase in anxiety in a novel situation. 
Although baseline cortisol levels were higher in 
the Lo-L group than in the Hi-L group (P < 
0.05), over time there were no treatment dif¬ 
ferences. The author noted that pigs selected 
for a lower lean-gain showed a higher activity 
level in the open-field test, and that it may be 
possible that, by selecting for high lean gain, 
an animal’s ability to cope with a novel situa¬ 
tion is changed, thus affecting the well-being of 
the animal. A more recent study by Fabrega 
et al. (2004) investigated the effect of the 
RYR1 (known as the halothane gene, and 
associated with stress susceptibility, which can 
trigger malignant hyperthermia) genotype on 
the open-field behaviour in growing pigs. The 
study subjected 15 heterozygous (Nn) and 15 
PYP1-free (NN) gilts of 19 weeks old to three 
replicates of an open-field test 2 days apart 
from each other. The study measured the 
number of grid lines crossed and the defecation 
score in the test arena. There was a significant 
individual correlation among the three replicates 
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of the test, both for the number of grid lines 
crossed and the defecation score (P < 0.05). 
The RYR1 genotype had a significant effect on 
the number of grid lines crossed, with NN gilts 
showing more overall activity than Nn gilts (P < 
0.05). In this work, no significant differences in 
defecation score occurred between the geno¬ 
types, and the authors concluded that the RYR1 
genotype may have an effect on the appraisal 
of novelty. 

The social behaviours of pigs towards 
humans show large individual differences 
(Lawrence et a/., 1991), and might be under 
genetic control. Hemsworth et al. (1990) 
measured pig fear in a human approach test. 
Pigs that approached more quickly were said 
to have less fear of humans, and the h 2 esti¬ 
mate for ‘fear of humans’ was 0.38 ± 0.19. 
Tonic immobility (the back test) data can be col¬ 
lected by placing each pig on its back, restrain¬ 
ing it in a supine position for a defined period 
of time, and counting the number of escape 
attempts and/or vocalizations. Hessing et al. 
(1993, 1994) described the back-test score dis¬ 
tribution as bimodal, suggesting that two dis¬ 
tinct phenotypes (active versus passive) exist. 
However, Forkman et al. (1995) could not 
confirm a bimodal distribution, and several 
other groups have reported a unimodal distri¬ 
bution of reactivity traits in the population 
(Lawrence et a/., 1991; Mormede et a/., 
1994). Forkman et al. (1995) compared piglet 
behaviour using social (rank order, social 
dependence, aggression) and non-social (nov¬ 
elty, extinction, back test) tests, and used prin¬ 
cipal component analysis to study the 
relationships between these different meas¬ 
ures. This analysis suggested three independ¬ 
ent personality traits: aggression, sociability 
and exploration. This approach is quite recent 
in pigs, but confirms the multidimensional fea¬ 
tures of behavioural response to social and 
non-social challenges that have been demon¬ 
strated in laboratory rodents and other species, 
including pigs. 

Velie et al. (2009) recently estimated 
repeatabilities and heritabilities for measures of 
pig behaviour and their relationship with per¬ 
formance. Measures of behaviour and perform¬ 
ance included the back test, resident-intruder 
test, human approach test (HAT), novel object 
test (NOT), day 1 body weight (BW), backfat 


WA 


depth (BF), loin muscle area (LMA), ADG in the 
farrowing house, ADG, 21-day BW and 140- 
day BW (or W). Each behavioural trait was 
measured twice. The study consisted of 95 
litters from 31 sires with an average of three 
litters per sire (n > 457). Between 7 and 14 
days old, the back test was conducted by plac¬ 
ing each pig in a supine position for 60s. Total 
time spent struggling (TTS) and total number of 
attempts to struggle (TAS) were recorded. The 
resident-intruder test involved two nursery pigs, 
a resident pig and an unfamiliar intruder pig. 
The resident pen was divided in half by a solid 
partition. A resident pig was placed in the test 
area, and an intruder pig was then introduced. 
Latency until an attack occurred (LAT) and total 
number of attacks over two tests (RIS) were 
recorded. The amount of time taken for each 
finishing pig to make snout contact with an 
unfamiliar human or object was recorded. Dam 
and sire effects influenced all traits (P < 0.01). 
Sex and pen affected LAT, RIS, HAT and NOT 
(P < 0.10). Repeatabilities of TTS, TAS, RIS, 
LAT, HAT and NOT were 0.38, 0.21, 0.07, 
0.08, 0.17 and 0.11, respectively. The pheno¬ 
typic correlations of TTS with TAS and HAT 
with NOT were 0.61 and 0.34, respectively. 
Phenotypic correlation between RIS and LAT 
was -0.85. TTS and TAS tended to be pheno- 
typically correlated with 21-day BW and ADG 
in the farrowing house. TAS was phenotypi- 
cally correlated with BF (0.15). LAT was 
phenotypically correlated with LMA (0.23). 
Resident intruder score was phenotypically cor¬ 
related with ADG (-0.13), W (-0.13) and LMA 
(-0.21) and estimated to be slightly heritable 
(i h 2 = 0.12). The heritabilities of TTS and TAS 
were 0.31 and 0.53, respectively. The genetic 
correlation of TAS with ADG and W was 0.38. 
Genetic correlations of TTS with BF, W and 
TAS were 0.14, 0.18 and 0.81, respectively. 


Genetics and neuroendocrine emotional 

responses 

In pigs, most available data deal with the HPA 
axis, although large breed/strain variation has 
been reported for circulating catecholamines in 
pigs (Mormede et al., 1984), as well as in labo¬ 
ratory rodents (McCarty and Kopin, 1978). 
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Individual differences in circulating cortisol 
levels were shown by Hennessy et al. (1988) 
to be related to changes in the adrenal 
response to ACTH (Hennessy et a/., 1988; 
Zhang et al., 1990, 1992), a result reported 
in humans as well (Bertagna et al., 1994). 
Indeed, the cortisol response to ACTH 
shows a very large range of inter-individual 
variation, but is a stable trait in individuals (in 
pigs, Von Borell and Ladewig, 1992; in 
humans, Bertagna et al., 1994). The cortisol 
response to ACTH can be influenced by 
chronic environmental stress, such as tight 
restraint (Janssens et al., 1994, 1995), and by 
genetic factors, as clearly demonstrated by sev¬ 
eral divergent selection experiments in poultry 
(Brown and Nestor, 1973; Edens and Siegel, 
1975; Satterlee and Johnson, 1988). Large 
differences in circulating cortisol levels between 
European and Chinese pigs have been 
described (Mormede et al., 1984), and evi¬ 
dence exists for genetic control of adrenocorti¬ 
cal activity from a study involving an F 2 
population from a cross between Large White 
and Meishan pigs (Mormede et al., 1994). 
Desautes et al. (1999) reported that the differ¬ 
ence originated from the adrenal gland, and 
that it was independent of the ACTH drive. 
Taken together, these experiments show that 
multiple mechanisms may explain the genetic 
control of adrenocortical function. 

Several experiments have demonstrated a 
link between HPA axis activity and production 
traits in domestic livestock species. Hennessy 
and Jackson (1987) and Barnett et al. (1988) 
showed that selection for growth performance 
was followed by reduced plasma cortisol levels 
and maximum corticosteroid binding capacity 
in pigs. Behavioural and neuroendocrine 
response modulation of stress in pigs as a result 
of genetic factors is now well established. Tools 
to investigate the molecular basis of genetic 
influences on the different stress responses 
are available, as demonstrated in mice by Flint 
et al. (1995). This approach of locating behav¬ 
ioural loci has been possible in the pig since 
the advent of pig genome maps (Archibald, 
1994, 1995; Rohrer et al., 1994) and the 

more recent announcement of the first draft 
sequence of the pig genome. 

Recent investigations have begun concen¬ 
trating on the genetic relationships between 


different emotional reactivity traits and eco¬ 
nomically important production traits. Desautes 
et al. (2002) conducted a QTL analysis of 
behavioural and neuroendocrine responses to 
a novel environmental stress in a three-genera¬ 
tion experimental cross between Meishan and 
Large White pig breeds. Locomotion, vocaliza¬ 
tion and defecation rate, as well as exploration 
time, were measured for 10 min. Blood sam¬ 
ples were taken immediately before and after 
the test to measure plasma levels of ACTH, 
cortisol and glucose. Animals were typed for a 
total of 137 markers covering the entire por¬ 
cine genome. The authors reported a highly 
significant gene effect for post-stress cortisol 
level (P < 0.001) and a significant effect for 
basal cortisol level (P < 0.05) at the end of the 
q arm of chromosome 7, which they noted as 
explaining 20% and 7% of the phenotypic 
variance. Meishan alleles were associated 
with higher cortisol levels and were partially 
dominant (for post-stress levels) over Large 
White alleles. Other significant gene effects on 
biological measures were detected on chromo¬ 
somes 1 and 17 (ACTH response to stress) 
and 3, 5 and 8 (glucose levels). An interesting 
observation by Hutson et al. (2000), when 
comparing the responses of individual growing 
pigs to 60 stimuli from five sensory categories, 
was that significant variation occurred between 
individual pigs and that future studies of emo¬ 
tion, stress and reactivity should use stimuli 
that elicit high responsiveness but have little 
variation. 


Aggressive Behaviours 

Aggressiveness in pigs shows wide phenotypic 
variation and if it could be reduced could enhance 
health, well-being and growth performance 
(Bergsma et al., 2008). Excellent work has been 
conducted using the laying hen as a model to 
reduce aggressiveness (feather pecking) through 
selection programmes based on their behav¬ 
ioural repertoire (Muir, 1996). A recent study by 
Labouriau et al. (2009) used a high feather peck 
(FP) selection line, which has been selected for 
eight generations, and the authors presented 
data showing that the gene transcription profile 
of the birds performing high FP differs from the 
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profile of the other birds performing FP (456 
genes differentially expressed from a total of 
14,077 investigated). In pigs, three recent studies 
have begun investigating behavioural genetics as 
it relates to aggression. 

Turner et al. (2006) determined the 
genetic contribution to aggressiveness in pigs 
by determining the relationship with skin 
lesions. A sample of 1132 pigs was mixed at 
an average weight of 27.9kg into 96 pens on 
a commercial sire line nucleus unit. Post-mixing 
aggressiveness of pigs was assessed and an h 2 
of 0.22 was estimated for the lesion score (LS) 
trait. The response to selection, when all selec¬ 
tion pressure was placed on the LS trait, was a 
25% reduction in LS per generation. Further 
work by Turner et al. (2008) used a Bayesian 
approach to estimate the heritability of three 
traits associated with aggressiveness in pigs 
during the 24 h post mixing: duration of recip¬ 
rocal aggression, and whether in receipt of or 
delivery of non-reciprocal aggression (NRA). In 
this study, the genetic correlations were quanti¬ 
fied between the behavioural traits and skin 
lesions (Table 9.3). The authors concluded 
that, based on the estimated genetic parame¬ 
ters, the selection of breeding values for 
reduced LS (especially LS for the central region 
of the body) is expected to reduce reciprocal 
aggression and the delivery of NRA, but will 
not change the receipt of NRA directly. In a 
follow-up study, Turner et al. (2009) estimated 


the genetic correlations between skin lesions 
and aggressive behaviour post mixing and 
under more stable social conditions as a poten¬ 
tial means of selecting against pig aggressive¬ 
ness. Aggressive behaviour was recorded 
continuously for 24h after mixing, and a count 
of skin lesions (lesion count, LC) was recorded 
at 24 h and 3 weeks post mixing on 1663 pigs. 
Two behavioural traits were found to have a 
moderate to high h 2 similar to that of growth 
traits, whereas receipt of NRA had a lower h 2 
(Table 9.3). Genetic correlations (r g ) suggested 
that lesions to the anterior region of the body 
24 h after mixing were associated with recipro¬ 
cal fighting (r = 0.67 ± 0.04), receipt of NRA 
(r g = 0.70 ± 0.11) and, to a lesser extent, deliv¬ 
ery of NRA (r g = 0.31 ± 0.06); lesions to the 
central and rear regions were primarily geneti¬ 
cally associated with receipt of NRA (r g = 0.80 
± 0.05, 0.79 ± 0.05, respectively). Genetic 
correlations indicated that pigs that engaged in 
reciprocal fighting delivered NRA to other 
animals but were less likely to receive NRA 
themselves (Table 9.3). Turner et al. (2009) 
noted that a genetic merit index using lesions to 
the anterior region as one trait and those to the 
centre or rear, or both, as a second trait should 
allow selection against animals involved in recip¬ 
rocal fighting and the delivery of NRA. Positive 
correlations between LC 24 h and 3 weeks after 
mixing were found, especially for lesions to the 
central and rear regions of the body, indicating 


Table 9.3. Heritabilities (on diagonal) and genetic (above diagonal) and residual correlations (below 
diagonal) between aggressive behavioural traits 24 h post mixing. Adapted from Turner et al., 2008, 


2009. 


Behavioural traits 



Reciprocal aggression 3 

Delivery of NRA b 

Receipt of NRA 

Reciprocal aggression 

2008 study 

0.46 

0.79 

-0.16 

2009 study 

0.43 

0.84 

-0.04 

Delivery of NRA 

2008 study 

0.15 

0.37 

0.16 

2009 study 

0.39 

0.31 

-0.41 

Receipt of NRA 

2008 study 

0.62 

-0.23 

0.17 

2009 study 

0.31 

0.23 

0.08 


Reciprocal aggression defined as reciprocal damaging fighting lasting >1 s. 

b Non-reciprocal aggression (NRA) defined as rest during a fight, withdrawal at the end of a fight and attack not 
associated with a fight. 
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that post-mixing lesions are predictive of those 
received under more stable group conditions. 
Therefore, these selection indexes could not 
only reduce immediate aggression at mixing, 
but also translate into a long-term impact on 
injuries from aggression, even after dominance 
relationships are established. 

Summary 

Swine domestication has altered aspects of the 
pig’s behaviour through genetic selection and 


the control of its environment. Over the past 
5 years, work identifying QTLs for pig behav¬ 
iour has become available, in particular for 
maintenance-related behaviours. We encour¬ 
age more information on the identification of 
QTLs that might influence maternal, social, 
emotional and aggressive behaviours. In par¬ 
ticular, the importance of QTL identification 
pertaining to social and aggressive behaviours 
will become paramount if legislation in the 
USA continues to ban gestation stalls and 
sows are placed into a group-housed gestation 
system. 
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Introduction 

Reproduction is an essential process for the 
maintenance of a species. It has to be geneti¬ 
cally controlled to ensure that the reproductive 
process is repeated with a strong degree of pre¬ 
cision. Yet a considerable genetic variability 
exists between breeds for both male and female 
reproductive traits. For instance, average litter 
size of mature sows varies from four to 16 piglets 
per litter among breeds. These differences, 
combined with the appreciable genetic varia¬ 
tion that also exists within breeds, have given 
the opportunity for substantial genetic improve¬ 
ment of sow prolificacy over the last 15 years 
(up to 30% of the mean in some breeds - see, 
e.g. Tribout et a/., 2003). The main conse¬ 
quence of increased litter size has undoubtedly 
been large gains in the efficiency of pig produc¬ 
tion systems, but it has also resulted in some 
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adverse effects, such as weaker oestrus symp¬ 
toms (Rydhmer, 2000) or lower piglet survival 
(Tribout et a/., 2003; Canario et a/., 2007a). 
Such unfavourable correlative trends not only 
reduce the positive short-term effects of 
selection on efficiency, but may also have long¬ 
term consequences, such as decreased fertility 
owing to an increased proportion of undetect¬ 
able oestrus or, for piglet mortality, may raise 
ethical problems. This emphasizes the import¬ 
ance of a broad perspective of the conse¬ 
quences of genetic improvement, which 
generates new questions, such as the societal or 
ethical consequences of selection, but it also 
strengthens the need for a good knowledge of 
the genetic (co)variation of a large number of 
potentially important traits. After a quick over¬ 
view of pig reproductive biology, current know¬ 
ledge on the genetic control of reproductive 
traits in pigs and their relationships with other 

©CAB International 2011. The Genetics of the Pig, 
2nd Edn (eds M.F. Rothschild and A. Ruvinsky) 



Biology and Genetics of Reproduction 


potentially important traits is reviewed. Both 
classical parameters characterizing among- 
breed (breed differences and heterosis effects) 
and within-breed (heritabilities and genetic cor¬ 
relations) genetic variability and new informa¬ 
tion on individual genes or genomic regions 
(quantitative trait loci, or QTLs) influencing 
reproductive traits are considered. 


Some Aspects of Pig Reproductive 

Biology 

Female 

Puberty in gilts, usually defined as the moment 
of first ovulation, occurs at 3-4 months of age 
in the earliest maturing breeds (Chinese) and at 
an average of 6-7 months of age in the most 
widely used Western breeds. It generally coin¬ 
cides with the first oestrus, though ovulation 
without external manifestation of oestrus (silent 
heat) occurs occasionally in pigs, and generates 
a steroid-secreting activity of the corpora lutea. 
Age at puberty is quite tedious to measure and 
is often replaced in field studies by age at first 
mating or at first farrowing. Although strongly 
correlated, the two traits differ owing to the 
large variability in management strategies 
between breeders (Le Cozier et a/., 1998; 
Rydhmer, 2000). Ovulations then occur every 3 
weeks during the second half of a 2-3 day 
oestrous period in the absence of gestation, and 
have a mean duration of 2-5h (Soede et a/., 
1992). The oestrous cycle is controlled by 
gonadotrophic hormones. Follicle stimulating 
hormone (FSH) stimulates recruitment and 
development of ovarian follicles. Ovulation and 
corpora lutea formation are stimulated by lutein¬ 
izing hormone (LH). Ovulation rate increases 
with oestrus and parity number until the fourth 
or fifth parity. Conception rate in the pig is high 
(80-90%) and has increased with the generali¬ 
zation of multiple matings (two or sometimes 
three services 12 or 24h apart during oestrus). 
Fertilization of the ova begins a few hours after 
mating after a necessary period of spermatozoa 
capacitation (Hunter, 1982). Fertilization rate is 
generally close to 100%. The rate of prenatal 
mortality in pigs is 30-40% on average. The 
largest part of losses (20-30%) occurs during 
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the first month of gestation, with an additional 
5-10% of fetal loss during late gestation 
(Wrathall, 1971; Christenson et a/., 1987). 

Farrowing lasts 3-5 h on average, with 
large variations between sows. It is initiated by 
the piglets, which produce corticosteroids, 
resulting in a production of prostaglandin by 
the placenta, which causes the regression of 
the corpora lutea and the initiation of the 
farrowing process. Parturition and the first few 
days after birth are critical periods for piglet 
survival. Up to 10% of piglets are stillborn in 
some populations (Holm et a/., 2004a; Canario 
et a/., 2007a), predominantly as a result of 
hypoxia during delivery. After birth, the time to 
get the udder, suckle and ingest an appropriate 
amount of colostrum and milk is a major deter¬ 
minant of piglet survival and growth. The sow 
has a strong control of milk ejection, with dura¬ 
tion of milk flow of 10-20 s, and an average 
nursing interval of less than an hour. Colostrum 
and milk production are difficult to measure 
directly, but can be indirectly estimated from 
piglet weight gain (Etienne et a/., 2000; 
Devillers et a/., 2004). Colostrum is essentially 
produced during the first 24h after parturition, 
and amounts to 4kg on average. Milk produc¬ 
tion peaks at around 21 days of lactation, and 
may reach 1kg daily for each piglet in sows 
nursing 10-12 piglets. With very few excep¬ 
tions, the lactating sow has a very limited fol¬ 
licular development, and does not ovulate or 
show any oestrus symptoms. The total removal 
of the sow from her litter at weaning normally 
results in an acceleration of follicular growth 
and in ovulation within 4-10 days. 


Male 

Spermatogenesis in the boar starts at 4-6 
months of age in most pig breeds, but may 
begin before 100 days of age in some early 
maturing breeds, such as the Chinese Meishan. 
Sperm quality and quantity then steadily 
increase with testicular development, testoster¬ 
one production and libido until sexual maturity 
at 6-8 months of age, and then at a much 
lower rate until boars reach their adult body 
size. A parallel rise in male accessory glands 
(the seminal vesicle, prostate and bulbo-urethral 
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glands), which produce 95% of the seminal 
plasma, results in a correlated increase in the 
volume of the ejaculate. Sexual activity is con¬ 
trolled by gonadotrophic hormones. FSH stim¬ 
ulates spermatogenesis, whereas LH stimulates 
steroid hormone (testosterone, but also other 
steroids such as androstenone) synthesis and 
secretion by the interstitial Leydig cells. The 
action of LH is dependent on the FSH induc¬ 
tion of LH receptors on the Leydig cells. Boar 
ejaculate is characterized by its large volume 
(around 300 ml on average) and spermatozoa 
number (80 to 120 billion when semen is col¬ 
lected once a week), which corresponds to total 
sperm reserves and widely exceeds daily sperm 
production (10 to 20 billion spermatozoa a 
day). As a consequence, spermatozoa number 
per ejaculate steadily decreases when the boar 
is used or ejaculate is collected more than once 
a week, in spite of a slight increase in sperm 
production with ejaculation frequency. Large 
amounts of spermatozoa and semen are neces¬ 
sary to ensure normal conception rate and pro¬ 
lificacy (50 ml of semen and 3 billion sperm are 
usually considered as minimum requirements 
for artificial insemination). Frozen boar semen 
can successfully be employed, with comparable 
or slightly lower conception rate and litter size 
than fresh semen, but it remains rather expen¬ 
sive and requires good technical expertise, so 
its commercial use remains quite limited. 


Between-breed Variation 
and Crossbreeding 

Breed differences 

Unlike production traits, breed comparisons 
for reproductive traits are not numerous owing 
to the size and costs of facilities that would be 
necessary to get accurate estimates of breed or 
line differences in a single environment. 
However, breed reproductive performance has 
until the mid-1990s remained almost 
unchanged, so that reasonably accurate esti¬ 
mates of breed differences could be obtained 
by compiling experimental results over time. 
The situation has radically changed over the 
last 15 years with the strong emphasis put on 
selection for litter size in pig dam lines and the 


large direct and correlative responses to selec¬ 
tion that have been obtained. Old breed com¬ 
parison results are no longer valid for traits 
related to litter size. For instance, the superior¬ 
ity of the Chinese Meishan breed over the 
Large White for litter size, which amounted to 
three to five piglets in the early 1990s (Bidanel, 
1993; Haley et a/., 1995), was reduced to 1.1 
piglets in 2005 (Canario et al., 2006b). In 
such cases, large-scale on-farm data recording 
systems are often the only way to get up-to- 
date information on breed performance. 
Though differences are purely phenotypic and 
may also reflect differences in average man¬ 
agement or environmental conditions, the data 
from Table 10.1 show the large differences 
that now exist between European breeds, as 
well as the differences in their ranking, with an 
advantage of Large White over Landrace in 
France and the reverse situation in Denmark. 

If the advantage of Meishan sows regard¬ 
ing prolificacy is probably lower than it used to 
be, these sows still give birth to litters that have 
a lower risk of stillbirth (Canario et a/., 2006b), 
in agreement with earlier studies (Blasco et aL, 
1995; Haley et a/., 1995). Similarly, Meishan 
females still perform better than Large White 
females for prenatal survival at a given ovula¬ 
tion rate (Bidanel et a/., 1990; Haley et al., 
1995) and for pre-weaning survival adjusted 
for litter size born (Lee and Haley, 1995), as 
selection has not improved prenatal or pre- 
weaning survival in the Large White breed 
(Tribout et a/., 2003; Canario et a/., 2007a). 
Meishan pigs have also been shown to reach 
puberty about 100 days earlier than Western 
breeds both in females (Despres et a/., 1992; 
White et al ., 1993) and in males (Frunier 
et al., 1987; Hochereau-de Reviers et al., 
1999), and to have a higher conception rate 
and a lower return to oestrus interval (Despres 
et al., 1992; White et al., 1993) than their 
Large White contemporaries. 

Many other local breeds have reproductive 
performances that widely differ, but are gener¬ 
ally lower than those of Large White or Landrace 
(Dobao et al., 1988; Gourdine et al., 2006); 
they will not be detailed here. Significant, 
although less important than with local breeds, 
differences in reproductive efficiency also exist 
between major commercial breeds. For instance, 
Landrace gilts have been shown to reach puberty 
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Table 10.1. Examples of comparative reproductive performance of some pig breeds. 3 


Trait 



Breed 


Reference 

Large White 

Dam line Sire line 

Landrace 

Duroc 

Pietrain 

Total number born 

14.2 

— 

14.6 

9.9 

— 

DSP, 2008 

Total number born 

14.8 

12.0 

13.7 

— 

10.0 


Number born alive 

13.6 

11.0 

12.5 

— 

9.3 

IFIP, 2009 

Number weaned 

11.4 

9.4 

10.8 

— 

7.8 



“Standard errors of breed means range from 0.03 to 0.1. 


earlier (Hutchens et a/., 1982; Allrich et a/., 
1985; Bidanel et a/., 1996), while having a 
slightly lower ovulation rate and a higher prena¬ 
tal survival rate than Large White gilts (Bidanel 
et a/., 1996). Breed differences in age at sexual 
maturity and in semen production traits have 
also been suggested in boars (Babol et a/., 
2004; Smital et a/., 2004); these variations 
could be related to differences in androstenone 
and skatole levels, the two major components of 
boar taint (Babol et a/., 2004). In addition to a 
lower prolificacy, Hampshire, Pietrain and 
Duroc breeds tend to have a lower conception 
rate (Sonderman and Luebbe, 2008) and mater¬ 
nal abilities, as shown by higher pre-weaning 
mortality rates compared with Large White or 
Landrace breeds (Blasco et a/., 1995). 


Crossbreeding 

Crossbreeding has for decades been known as 
an effective means of enhancing reproductive 
efficiency, in particular because of ‘hybrid vig¬ 
our’, or heterosis. For traits such as litter traits, 
which are controlled by both piglets and sow 
genotypes, enhanced performance may come 
from both crossed piglets (i.e. litter, direct or 
individual heterosis effects) and crossbred dams 
(i.e. sow or maternal heterosis effects). A large 
number of crossbreeding experiments carried 
out in the 1970s and 1980s have provided a 
rather good picture of heterosis effects in pigs 
(Table 10.2). Most reproductive traits exhibit 
significant heterosis, the largest effects being 
associated with sow genes. Indeed, as compared 
with purebreds, crossbred sows reach puberty 


earlier, have a higher conception rate, a slightly 
larger ovulation rate and better embryo/fetal 
survival rates, and farrow larger litters and have 
better nursing abilities, i.e. they raise piglets that 
have a higher probability of survival and grow 
faster than piglets nursed by purebred sows. 
Piglet heterosis leads to slightly larger litter size 
at birth and, above all, to higher piglet growth 
and survival during the nursing period. 

Heterosis effects vary according to breed 
combination. For instance, Large White x 
Landrace crosses generally exhibit somewhat 
lower heterosis values than other crosses 
between Western breeds. Yet crosses between 
Western and Chinese local breeds are the most 
striking illustration of breed combination differ¬ 
ences in heterosis, with effects on sow heterosis 
being up to five times larger than the values usu¬ 
ally encountered between European and/or 
American breeds (Table 10.2). 

Several experiments have compared 
crossbred with purebred boars (see review of 
Buchanan, 1987). Crossbred boars reach sex¬ 
ual maturity earlier than purebreds. As a con¬ 
sequence, young crossbred males have larger 
testes size and a higher sperm production than 
their purebred counterparts. They also have 
higher conception rates and tend to have more 
libido and to be more aggressive than purebred 
boars. This superiority then tends to diminish 
as boars mature, but it is not clear whether it is 
completely eliminated in adult animals. Finally, 
differences in litter size sired by crossbred ver¬ 
sus purebred boars are generally small and not 
significant (Buchanan, 1987). 

It has to be emphasized that the improved 
reproductive performance of crossbred sows 
is a major reason for the generalized use of 
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Table 10.2. Average heterosis effects for reproductive traits. Adapted from Rothschild and Bidanel (1998) 


Cross between European/American Cross between Large White and 

breeds Meishan breeds 



two-generation crossbreeding schemes in pig 
production. Indeed, most individual pig pro¬ 
ducers practise crossbreeding through the 
use of specialized paternal and maternal geno¬ 
types (Moav, 1966; McLaren et ai, 1987). In 
most cases, the maternal genotype is an F 1 
sow, but both more complex schemes using 
great-grandparental populations to produce 
three-way parental sows and simple partial 
rotational crossbreeding schemes are also used 
at a significant level. 


Within-breed Genetic Variability 
Components of genetic variation 

Estimates of heritability for the major female 
and male reproductive traits are summarized in 
Table 10.3. Female reproductive traits have 


low-to-moderate heritabilities. The most herit¬ 
able traits are those depending solely on the 
genotype of the female, i.e. age at puberty, at 
first mating or farrowing, ovulation rate, wean¬ 
ing to oestrus interval and measures of the 
intensity of oestrus symptoms. Fertility and pro¬ 
lificacy traits, i.e. conception or farrowing rate, 
litter size, piglet survival rates and, to a lesser 
extent, litter weight, which result from complex 
interactions between sow, boar and embryo or 
piglet genotypes, have low heritabilities and are 
more difficult to improve through selection. 

Heritability value estimates for male traits 
are much less numerous than for female traits. 
As most body composition traits, testes and 
accessory gland measurements, which are indica¬ 
tors of the sexual development of young boars, 
have moderate-to-high heritability values and are 
expected to easily respond to selection. Sperm 
characteristics have low-to-moderate heritabili¬ 
ties when measured on individual ejaculates 
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Table 10.3. Heritability (/t 2 ) estimates for female and male reproductive traits in the pig. Adapted from 
Rothschild and Bidanel (1998). 


Trait No. of estimates Mean If Range 


Female traits Age at puberty 

Oestrus symptoms 
Ovulation rate 
Conception rate 
Prenatal survival rate 
Total number born 
Number born alive 
Number weaned 
Farrowing survival rate 
Birth to weaning survival rate 
Farrowing length 
Litter homogeneity at birth 
Litter weight at birth 
Birth assistance 
Litter weight at 21 days 
Weaning to oestrus interval 
Rebreeding interval 
Male traits Testis width 

Testis weight 
Semen volume 
Sperm concentration 
Sperm motility 
% Abnormal sperm 
Libido 


16 

0.37 

0-0.73 

3 

0.21 

0.09-0.29 

18 

0.32 

0.10-0.59 

3 

0.10 

0-0.29 

12 

0.15 

0-0.23 

103 

0.11 

0-0.76 

118 

0.10 

0-0.66 

54 

0.08 

0-1.0 

12 

0.07 

0.03-0.14 

12 

0.05 

0-0.13 

2 

0.07 

0.05-0.10 

6 

0.08 

0.03-0.1 

18 

0.24 

0-0.54 

2 

0.05 

0.05 

22 

0.14 

0.07-0.38 

5 

0.22 

0.11-0.36 

3 

0.23 

0.03-0.36 

8 

0.37 

0.02-0.61 

5 

0.44 

0.24-0.73 

6 

0.19 

0.14-0.25 

6 

0.19 

0.13-0.26 

6 

0.11 

0.06-0.18 

4 

0.10 

0.06-0.17 

13 

0.15 

0.03-0.47 


(Grandjot et a/., 1997a; Wolf, 2009). The much 
higher heritability values reported in some studies 
(Smital et a/., 2005) are based on mean values 
per boar (averaged over ejaculates). Levels of 
steroid hormones have also been found to be 
moderately-to-highly heritable. Realized heritabil¬ 
ity values of 0.15 and 0.18 were obtained in lines 
divergently selected for basal testosterone levels 
(Robison et a/., 1994). More numerous estimates 
are available for the levels of androstenone, a 
pheromone that accumulates in fat tissues of 
males and is a major component of boar taint. 
Heritability values are high, around 0.55 on aver¬ 
age (Robic et a/., 2008). 

Some authors have quantified the relative 
importance of maternal, paternal and embryo/ 
piglet effects on genetic variation in litter traits. 
They all have confirmed that the largest part of 
genetic variation is due to sow genes, except 
for fertility, where similarly low paternal and 
maternal heritabilities were described by 
Varona and Noguera (2001). The service sire 
has been shown to have a rather limited effect 
on litter size, only 1-5% of phenotypic variance 


(Beauvois et ai, 1997; Van der Lende et a/., 
1999; Serenius et a/., 2003; Hamann et a/., 
2004; Su et a/., 2007). However, several stud¬ 
ies conclude that taking this limited male influ¬ 
ence into account improves genetic evaluation 
models for litter size (Serenius et ai, 2003), 
and even gives the opportunity to detect boars 
as potential carriers of chromosomal abnor¬ 
malities (Tribout et a/., 2000). 

Embryo/piglet genes also potentially influ¬ 
ence their own survival and growth. Most esti¬ 
mates of direct genetic effects on piglet survival 
at birth or during the nursing period are low 
(below 0.05) compared with maternal effects 
(Knol et a/., 2002a; Lund et a/., 2002; Ibanez- 
Escriche et a/., 2009), although somewhat 
higher estimates (around 0.10) have been 
reported in some studies or populations (Van 
Arendonk et al. , 1996). Similarly, growth dur¬ 
ing gestation is prominently due to the sow’s 
genes, with two to five times higher heritability 
values for maternal (0.17-0.26) than for direct 
(0.03-0.10) effects on piglet birth weight 
(Roehe, 1999; Knol et ai, 2002a; Solanes 
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et a/., 2004; Rosendo et ai, 2007b; Roehe 
et ai, 2009). The influence of piglet genes 
then increases, but remains lower than mater¬ 
nal genetic effects as far as piglet growth mainly 
depends on sow milk, i.e. until 3 to 4 weeks of 
age (Kaufmann et a/., 2000; Solanes et ai, 
2004; Rosendo et ai, 2007b). 

Maternal effects strongly reduce or even 
vanish after weaning. Yet several authors have 
suggested an effect of the pre-weaning environ¬ 
ment provided by the female’s dam on litter traits, 
which would reduce the efficiency of selection for 
litter size. Indeed, significant maternal genetic 
effects have been reported by some authors 
(Southwood and Kennedy, 1990; Ferraz and 
Johnson, 1993; Irgang et ai, 1994), although 
low estimates were reported in other studies 
(Perez-Enciso and Gianola, 1992; Chen et ai, 
2003a). Such variations may reflect differences 
between populations or management conditions 
(e.g. cross-fostering or age at weaning). In any 
case, maternal effects, if present, should be con¬ 
sidered in genetic evaluation for prolificacy 
(Roehe and Kennedy, 1993). Similarly, the 
hypothesis that successive litters might have a 
somewhat different genetic determination has 
been put forward to explain lower than expected 
response to selection for litter size in some popu¬ 
lations. Indeed, low genetic correlations were 
reported in several early studies. However, as 
emphasized by Haley et ai (1988), estimates 
were likely to be biased downward owing to cull¬ 
ing. More recent studies using statistical methods 
that account for the selection bias have given a 
clearer picture of the situation. Genetic correla¬ 
tions between adult, i.e. third and later parities, 
performances remain very high in all studies 
(>0.8), whereas much lower values have been 
obtained between first and, to a lesser extent, 
second parities, compared with later parity 
records (Irgang et a/., 1994; Roehe and Kennedy, 
1995; Hanenberg et ai, 2001; Noguera et ai, 
2002). Values in commercial herds with poorer 
environments are generally lower. 

Genetic correlations between 
reproduction traits 

Phenotypic and genetic correlations between 
male genital tract measurements are generally 
large (Toelle et ai, 1984; Sellier et ai, 2000). 
Testes measurements are favourably related to 


sperm production or percentage spermatogen¬ 
esis (Toelle et ai, 1984; Young et ai, 1986; 
Huang and Johnson, 1996). An increased 
daily sperm production, a higher sperm con¬ 
centration, a greater number of spermatozoa 
per ejaculate and a lower proportion of abnor¬ 
mal cells were found in a line selected for size 
of the testes at 150 days of age (Rathje et ai, 
1995; Huang and Johnson, 1996). Divergent 
selection of boars for testosterone production 
was also effective, with realized heritabilities of 
0.15 and 0.24 in the low and high lines, 
respectively, and significant line differences in 
testicular size, epididymal weight and volume 
density of Leydig cells, but without any change 
in daily sperm production per gram of testes 
(Robison et ai, 1994; Walker et ai, 2004). 
Estimates of the genetic parameters of sperm 
characteristics are not very numerous 
(Falkenberg et ai, 1988; Grandjot et ai, 
1997b; Smital et ai, 2005; Wolf, 2009), but 

clearly show a strong negative correlation 
between semen volume and sperm concentra¬ 
tion, as well as between sperm motility and the 
proportion of abnormal cells. Conversely, 
motility appears as nearly independent from 
both sperm volume and concentration, except 
in the study of Falkenberg et ai (1988). Finally, 
a five-generation divergent selection experi¬ 
ment on fat androstenone level (Willeke et ai, 
1987) resulted in a significant direct response, 
mainly in the high line, as well as in large 
significant correlated responses on plasma tes¬ 
tosterone and conjugated oestrogen levels. 

Genetic relationships between male and 
female reproductive traits have been estimated 
in some studies to examine the interest of using 
male traits as indirect selection criteria to 
improve female reproductive performance. 
Estimates of genetic correlations between tes¬ 
tes measurements and age at first oestrus, ovu¬ 
lation rate or litter size are generally low and do 
not show any consistent trend (Schinckel etai, 
1983; Toelle and Robison, 1985; Young et ai., 
1986; Johnson et ai, 1994). Conversely, 
Smital et ai (2005) reported significantly neg¬ 
ative and positive genetic correlations (0.3-0.4) 
between, respectively, sperm volume and 
sperm motility of boars and the litter size of 
their daughters. Selection for low-fat andros¬ 
tenone levels was also shown to result in a 
delayed puberty of females in the selection 
experiment of Sellier and Bonneau (1988). 
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Phenotypic and genetic correlations 
between several female traits are shown in 
Table 10.4. Age at puberty has, on average, 
close to zero relationships with ovulation rate 
and litter size at birth and at weaning, and 
weak negative, i.e. favourable correlations, 
with litter weights. Mean values for genetic cor¬ 
relations between age at first mating and sow 
fertility or longevity are also low (<0.10 in 
absolute value). However, these average values 
hide noticeable differences between studies 
(Ruiz-Flores and Johnson, 2001), which are 
partly due to the limited accuracy of genetic 
parameter estimates, but also probably result 
from both differences in the genetic make-up 
of pig populations and differences in manage¬ 
ment practices, e.g. mean age at first service or 
feeding level, which may affect the magnitude 
of competition effects between growth and 
reproduction for resource allocation. Results 
are more homogeneous regarding the associa¬ 
tion between age at first service and weaning 
to oestrus interval, which appears as clearly 
positive in the different studies investigating 
this relationship (Hanenberg et a/., 2001; 
Holm et a/., 2005; Imboonta et a/., 2007). 
Finally, nearly zero genetic correlations 
between age at puberty and traits characteriz¬ 
ing the intensity of oestrus symptoms were 
obtained by Rydhmer et al. (1994). 

Studies on relationships between rebreed¬ 
ing performance and other reproductive traits 
are not numerous in the literature. Most studies 


reported close to zero genetic correlations 
between weaning to service interval and litter 
size at birth (Hanenberg et a/., 2001; Holm 
et a/., 2005; Imboonta et al., 2007; Lundgren 
et al., 2010). No measurable correlative 
responses to selection for a reduced weaning to 
oestrus interval were reported (ten Napel et al., 
1998). Yet some authors (Sterning et al., 1990; 
Lundgren et al., 2010) have suggested a poten¬ 
tial antagonism between the investment of sows 
to raise their litter and their subsequent rebreed¬ 
ing performance. Though evidence for an 
antagonism between performance and fertility 
is still limited in pigs - only Lundgren et al. 
(2010) reported a negative, but not significant, 
genetic correlation between the interval from 
weaning to oestrus and litter weight gain - it is 
well documented in other species with high per¬ 
formance levels such as dairy cattle and should 
receive some attention in the future. 

Unlike fertility traits, the genetic determi¬ 
nation of prolificacy traits is now rather well 
documented. On average, ovulation rate has a 
moderate genetic antagonism (negative correla¬ 
tion) with prenatal survival (r g = -0.36, Table 
10.4), so that selection for ovulation rate results 
in limited correlative improvement in litter size 
at birth (Cunningham et al., 1979; Rosendo 
et al., 2007a). Several selection experiments 
have been implemented to increase litter size 
using different indirect criteria: an index of ovu¬ 
lation rate and embryo/prenatal survival (Johnson 
et al., 1984; Ruiz-Flores and Johnson, 2001; 


Table 10.4. Means of literature estimates of genetic and phenotypic correlations 3 among reproductive traits. 



AP 

OR 

PSR 

TNB 

FSR 

NBA 

BWSR 

NW 

LBW 

L21W 

ABW 

AP 


-0.06 

-0.08 

-0.04 

— 

0.07 

— 

0.09 

-0.10 

-0.15 

— 

OR 

0.05 


-0.36 

0.32 

-0.27 

0.24 

-0.38 

0.01 

0.24 

0.03 

-0.23 

PSR 

-0.01 

0.14 


0.50 

0.3 

0.55 

-0.25 

0.42 

0.30 

0.10 

-0.41 

TNB 

-0.03 

0.13 

0.60 


-0.25 

0.92 

-0.15 

0.73 

0.62 

0.40 

-0.41 

FSR 

— 

0.06 

-0.15 

-0.08 


0.01 

0.17 

-0.01 

-0.10 

0.05 

0.22 

NBA 

-0.03 

0.12 

0.40 

0.91 

0.15 


-0.14 

0.81 

0.64 

0.55 

-0.34 

BWSR 

— 

-0.11 

-0.14 

-0.12 

0.08 

-0.22 


0.15 

-0.07 

0.65 

0.15 

NW 

-0.01 

0.03 

0.36 

0.71 

0.47 

0.79 

0.55 


0.67 

0.81 

-0.23 

LBW 

-0.03 

0.07 

0.55 

0.79 

0.43 

0.82 

0.09 

0.71 


0.65 

0.43 

L21W 

-0.04 

0.02 

0.08 

0.42 

0.36 

0.46 

0.65 

0.80 

0.61 


0.60 

ABW 

— 

-0.17 

-0.32 

-0.40 


-0.44 


-0.17 

0.10 

0.07 



a Genetic correlations above the diagonal, phenotypic correlations below. 

AP, age at puberty; OR, ovulation rate; PSR, prenatal survival rate;TNB, total number born; FSR, farrowing survival rate; 
NBA, number born alive; BWSR, birth to weaning survival rate; NW, number weaned; LBW, litter weight at birth; L21W, 
21-day litter weight; ABW, average birth weight. 
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Rosendo et a/., 2007a), uterine capacity 
(Christenson et a/., 1987) or placental effi¬ 
ciency (Wilson et a/., 1999). Significant 

improvements in litter size have been obtained 
in several experiments (Ruiz-Flores and 
Johnson, 2001; Rosendo et a/., 2007a), but 
none of them outperformed direct selection for 
litter size. Interestingly, results from successful 
selection for litter size show no improvement in 
prenatal survival and a proportional increase in 
ovulation rate (Driancourt et a/., 1992; Tribout 
et a/., 2003). 

Genetic correlations between litter size at 
birth and at weaning are strongly positive 
(>0.71, Table 10.4). Conversely, litter size 
traits have some antagonistic relationships with 
piglet survival. In particular, farrowing survival 
rate is unfavourably correlated with ovulation 
rate (r = -0.27) and total number born (r = 
-0.25), but independent of number born alive 
and number weaned. Birth to weaning survival 
rate is negatively linked with ovulation rate and 
litter size at birth, and has a slightly positive 
correlation with the number of piglets weaned. 
Responses to selection are consistent with 
genetic parameters. Selection for ovulation 
rate or total number born has been shown to 
result in an increase in the number of stillborn 
piglets (Blasco et a/., 1995; Ruiz-Flores and 
Johnson, 2001; Canario et a/., 2007a; Guery 
et a/., 2009), whereas no deterioration of far¬ 
rowing survival is observed when selecting for 
number born alive (Guery et a/., 2009). 

Average birth weight appears to have neg¬ 
ative genetic correlations with litter size and 
positive ones with survival rates and litter 
weights. Yet the relevance of increasing piglet 
birth weight to improve survival is strongly 
debated, as it would be associated with a higher 
nutritional demand from the litter at the end of 
gestation and during lactation, which may be 
detrimental to the sow (Bergsma et a/., 2008). 
Moreover, bigger piglets might be associated 
with increased farrowing difficulties (Canario 
et a/., 2006b). Some authors have suggested that 
improving the homogeneity of piglet weight at 
birth would result in higher piglet survival. 
Indeed, several studies have reported positive 
relationships between within-litter variation in 
birth weight and pre-weaning mortality (Roehe 
and Kalm 2000; Knol et a/., 2002b; Milligan 
et a/., 2002; Huby et a/., 2003). Additionally, 


results from a divergent selection experiment 
for the homogeneity of birth weight in rabbits 
showed a significant direct response to selec¬ 
tion and lower farrowing (16.6% versus 18.6%) 
and birth to weaning (17.7% versus 32.7%) 
mortality rates of kits in the ‘homogeneous’ 
than in the ‘heterogeneous’ line (Garreau 
et a/., 2008). However, the efficiency of such 
a criterion has been questioned by some authors 
owing to the important amount of work repre¬ 
sented by the individual weighing of piglets and 
to the low heritability obtained in some pig pop¬ 
ulations (Wolf et a/., 2008). Piglet maturity at 
birth has been shown to be another important 
issue in piglet survival. Indeed, embryos/piglets 
from Meishan dams have been shown to have a 
higher probability of survival and to be more 
mature at birth than embryos/piglets from 
European and/or American dams (Le Dividich 
et a/., 1991; Canario et a/., 2007b). Moreover, 
selection for lean tissue growth rate has been 
shown to reduce piglet maturity at birth (Herpin 
et a/., 1993; Canario et a/., 2007b). Yet finding 
simple criteria to characterize piglet maturity 
currently remains a challenge. 

Farrowing kinetics is another important 
determinant of farrowing survival. The proba¬ 
bility of stillbirth has been shown to be increased 
(Holm et a/., 2004a; Canario et a/., 2006a) in 
prolonged farrowings or with longer birth inter¬ 
vals because piglets are more prone to be 
asphyxiated or suffer some degree of hypoxia. 
Longer farrowings may also affect birth to 
weaning survival, as piglets that have suffered 
from hypoxia take a longer time to reach the 
udder and have a higher probability of being 
crushed or suffering from hypothermia. After 
birth, piglet survival and growth largely depend 
on the ability of females to take care of their 
progeny and to provide a sufficient amount of 
colostrum and milk. Sow behavioural traits are 
lowly-to-moderately heritable depending on 
test standardization conditions (Grandinson 
et a/., 2003; Hellbrugge et a/., 2008) and may 
affect piglet survival. As reviewed by Grandinson 
(2005), sows with a high responsiveness 
towards their offspring, a low fear response 
towards humans and a high frequency of suc¬ 
cessful nursings tend to have fewer piglet 
losses. Colostrum consumption, which pro¬ 
vides the piglet with both energy and maternal 
antibodies, and can be indirectly estimated 
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from piglet growth rate over the first 24 h of 
life (Devillers et al., 2004), has also been 
shown to be an important determinant of piglet 
survival (Le Dividich et a/., 2005). However, 
contrary to milk production, colostrum produc¬ 
tion was found to remain unchanged after 
21 years of selection in French Large White 
breed pigs (Canario, 2006). 

Milk production is difficult to record, but 
can be indirectly estimated from litter growth 
rate (Etienne et a/., 2000); production is influ¬ 
enced by the stimuli from the suckling piglet 
and has increased as a consequence of higher 
sow prolificacy and more active piglets 
(Mackenzie and Revell, 1998; Canario, 2006). 
As selection has led to a strong reduction of 
sow body fat reserves and has had limited 
effects on lactational sow feed intake (Tribout 
et ai, 2003), it has probably resulted in a more 
pronounced negative energy balance of sows 
during lactation, with potentially unfavourable 
effects on health, reproduction and longevity 
(Bergsma et ai, 2008). This negative energy 
balance could be limited by increasing lacta¬ 
tional sow feed intake, by decreasing piglet 
early growth potential or by increasing lacta¬ 
tion efficiency - defined as the ratio of energy 
output for lactation to sow energy input 
(Bergsma et ai, 2008). A last important issue 
is the number of functional teats, which can be 
a limit to the number of piglets a sow can 
nurse. Teat number has been shown to have a 
moderate heritability, to be genetically inde¬ 
pendent of production traits and litter size at 
birth, and to have a positive genetic correlation 
with litter size at weaning (Ligonesche et a/., 
1995; Zhang et a/., 2000). 


Genetic correlations with other 
economically important traits 

Estimates of genetic correlations of reproduc¬ 
tion with growth and carcass traits are rather 
numerous in the literature. In males, testes 
measurements have been shown to have posi¬ 
tive genetic correlations with growth traits 
when measured at a constant age (Toelle et ai, 
1984; Young et a/., 1986; Lubritz et a/., 
1991; Johnson et a/., 1994). Genetic correla¬ 
tions are more variable when measurements 


occur at a constant weight, as both positive 
(Schinckel et a/., 1983) and negative (Sellier 
et ai, 2000) estimates have been obtained. 
Growth rate has also been found to be posi¬ 
tively correlated with plasma testosterone level 
(Lubritz et a/., 1991) and to have positive or 
null associations with fat androstenone levels 
(Willeke and Pirchner, 1989). Genetic correla¬ 
tions of reproduction traits with backfat thick¬ 
ness are generally low (Toelle et ai, 1984; 
Sellier et a/., 2000). 

Age at puberty of gilts exhibits negative, 
i.e. favourable, genetic correlations with growth 
rate (Young et ai, 1978; Hutchens et ai, 
1981; Rydhmer et a/., 1992; Bidanel et ai, 
1996), and has weak relationships with backfat 
thickness or carcass lean content (Rydhmer et al ., 
1992, 1995; Bidanel et a/., 1996; Serenius et a/., 
2004). Results regarding relationships between 
fertility and production traits are much more 
limited. Rydhmer et al. (1994) obtained nega¬ 
tive correlations between traits characterizing 
the intensity of oestrus symptoms and growth 
rate, whereas relationships with carcass lean¬ 
ness were close to zero. Significant differences 
in conception rate were obtained between a 
line selected for high lean growth on scale 
feeding and a line selected for low lean growth 
(64% versus 83%) by Kerr and Cameron 
(1996). Adamec and Johnson (1997) and ten 
Napel et al. (1998) reported rather low genetic 
correlations (<0.20 in absolute value) between 
re-breeding performance and growth or carcass 
traits, but pointed out a risk of downward bias 
due to culling and mentioned that selection for 
litter weight gain and against backfat might 
result in prolonged intervals. 

Early literature reviews (Brien, 1986; Haley 
et al ., 1988) concluded that litter size and 
weights are, on average, weakly correlated with 
performance traits. Yet several more recent 
results suggest the existence of slightly unfavour¬ 
able genetic relations between the two sets of 
traits in populations with high performance lev¬ 
els. Negative genetic correlations between 
growth and litter size were estimated by Ducos 
and Bidanel (1996), Hermesch et al. (2000a), 
Zhang et al. (2000), Holm et al. (2004b) and 
Tribout and Bidanel (2008). The majority of 
more recent estimates of genetic relationships 
between litter traits and carcass lean content are 
slightly positive, i.e. unfavourable (e.g. Zhang et al., 
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2000; Chen et a/., 2003b; Serenius et al., 
2004; Imboonta et al., 2007). The situation is 
less clear for feed intake and feed efficiency 
because of a much lower number of estimates. 
Close to zero relationships were reported by 
Hermesch et al. (2000b), but significant corre¬ 
lated responses of litter size to selection for high 
and low feed intake and for low residual feed 
intake were obtained by, respectively, Kerr and 
Cameron (1995) and Sellier et al. (2010). 

With the exception of the above-mentioned 
relationships between fat androstenone level 
and sexual development, estimates of genetic 
correlations between reproduction and meat 
quality traits are not numerous in the literature. 
Most available estimates with meat pH, meat 
colour and meat water-holding capacity do not 
significantly differ from zero (Larzul et a/., 
1999; Hermesch et a/., 2000a; Serenius et ai, 
2004; Rosendo et al., 2010) and tend to indi¬ 
cate that the two groups of traits can reasona¬ 
bly be considered as almost genetically 
independent. 


Individual Genes and Quantitative 
Trait Loci (QTLs) Affecting 
Reproduction Traits 

The dramatic advances in molecular biology 
and genomics over the past 20 years have 
profoundly changed our knowledge of the 
genetic determination of economically import¬ 
ant traits in major livestock species. In particu¬ 
lar, the development of panels of genetic 
markers covering the whole genome has 
allowed the individual loci underlying the genetic 
variance of quantitative traits of economic 
importance to be systematically detected and 
mapped. A large number of experiments aim¬ 
ing at detecting these QTLs have led to the 
identification of a large number of such loci 
(more than 5600 QTLs are currently referenced 
in the PigQTLdb accessible at http://www. 
animalgenome.org/cgi-bin/QTLdb/SS/index 
(Hu et al., 2007). It has to be noted that these 
QTLs have essentially been detected using low- 
density maps based on panels of microsatellite 
markers and not the high-density linkage maps 
based on single nucleotide polymorphism (SNP) 
markers that have recently become available 


(Ramos et al., 2009). QTLs for male and 
female pig reproductive traits detected so far 
are summarized in Tables 10.5 and 10.6, 
respectively. The main functional candidate 
genes associated with female reproductive 
traits that have been investigated are given in 
Table 10.7. 

QTLs for male reproductive traits have 
been mapped using crosses between early 
maturing Chinese and European/American 
breeds. Most of these have a rather limited 
effect (less than 10% of trait variation), but a 
limited number of loci, located on chromosome 
X, explain up to 20% of the variance of several 
male reproductive traits, i.e. testicular and 
seminal vesicle weight, as well as seminiferous 
tubular diameter. Alleles from Chinese breeds 
increase testes weight at young ages, but tend 
to reduce weight at older ages. The SERPINA 7 
gene, which regulates the availability of thyroid 
hormones within tissues, has been reported 
as an interesting positional candidate by Ren 
et al. (2009), but its potential implication 
remains to be investigated. No fine mapping of 
autosomal QTLs has been carried out as yet, 
and the localization interval of QTLs remains 
large (generally above 20 cM). 

QTLs for female reproductive traits are 
given in Table 10.6. QTLs for teat number, 
which are much more numerous than QTLs 
for true reproductive traits because its meas¬ 
urement is very easy, are not reported here. 
The 71 QTLs for teat number reported in 
PiGQTLdb are located on all chromosomes 
except SSC (Sus scrofa chromosome) 14 and 
18. They generally explain a limited propor¬ 
tion of trait variability, so that no fine-mapping 
programme has been developed. The most 
numerous QTLs then concern the most herit¬ 
able traits, i.e. age at puberty and ovulation 
rate (23 and 22 QTLs, respectively, on 
PiGQTLdb). QTLs for age at puberty have 
been mapped to ten different chromosomes, 
with overlapping localizations from independ¬ 
ent studies on SSC1, on SSC7 in the SLA 
(swine leucocyte antigen) complex region and 
at the extremity of the short arm of SSC8. 
QTLs for ovulation rate have been detected on 
nine different chromosomes, with QTLs from 
several independent studies located on two 
different regions of SSC8, the first one in the 
centromeric region (Wilkie et al., 1999) and 
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Table 10.5. Quantitative trait loci (QTLs) for male reproductive traits. 


Trait 

Pig chromosome 

Population 3 

Size 

% variance 

Reference 

Ejaculation duration 

6, 17 

DU x ER 

177 

71-7.9 

Xing etal., 2009 

Ejaculation times 

6, 16, 17 

DU x ER 

177 

5.9-11.8 

Xing et al., 2009 

Epididymal weight at: 






90 days of age 

2 

DU x ER 

347 

4.5 

Ren et al., 2009 

180 days of age 

3, 4, 10, 13, 15 

LWx MS 

487 

1.9-4.3 

Bidanel et al., 2001 

300 days of age 

3,7 

DU x ER 

347 

4.5-27.3 

Ren et al., 2009 

Length of bulbo-urethral 

1,3, 7, 13 

LWx MS 

485 

3.3-5.1 

Bidanel et al., 2001 

glands 






Plasma FSH level b 

3, 10, X 

WC x MS 

315 


Rohrer et al., 2001 

Plasma testosterone level 

7, 13 

DU x ER 

347 

7.3-14.3 

Ren et al., 2009 

Semen volume 

3, 15, 18 

DU x ER 

177 

7.9-8.6 

Xing et al., 2009 

Seminiferous tubular 






diameter at: 






90 days of age 

5, 13, 14, X 

DU x ER 

347 

8.4-14.8 

Ren et al., 2009 

300 days of age 

16 

DU x ER 

347 

14.8 

Ren et al., 2009 

Seminal vesicles weight 

1,3, 4, 7, 11, 

LWx MS 

481 

2.5-21.8 

Bidanel et al., 2001 


15, 16, X 





Sperm abnormality rate 

4,9 

DU x ER 

177 

8.8-11.8 

Xing et al., 2009 

Sperm concentration 

17 

DU x ER 

177 

9.5 

Xing et al., 2009 

Sperm motility 

4 

DU x ER 

177 

6.3 

Xing etal., 2009 

Sperm pH value 

2, 6,9 

DU x ER 

177 

5.7-9.8 

Xing etal., 2009 

Testicular weight at: 






60 days of age 

3, X 

DU x MS 

449 

5.0-9.0 

Sato etal., 2003 

90 days of age 

i,x 

DU x ER 

347 

9.1-20.6 

Ren et al., 2009 

180 days of age 

4, 7, 10, 13, 

LWx MS 

487 

3.5-19.6 

Bidanel et al., 2001 


17, X 





220 days of age 

X 

WC x MS 

315 


Rohrer et al., 2001 

300 days of age 

1,5, 7, X 

DU x ER 

347 

4.8-14.7 

Ren et al., 2009 


a DU, Duroc; ER, Erhualian; LW, Large White; MS, Meishan; WC, White European breed cross. 
b FSH, follicle stimulating hormone. 


the second one in the telomeric part of the 
chromosome short arm. Fine mapping of these 
two regions has been performed by, respec¬ 
tively, Braunschweig et al. (2001) and 
Campbell et al. (2003). The gene coding for 
mannosidase 2B2 (MAN2B2) was proposed 
by Campbell et al. (2008) as a positional can¬ 
didate, but no causative polymorphism has 
been identified so far. A C/G substitution in the 
3' UTR (three prime untranslated mRNA 
region) of a functional candidate, the GNRHRH 
gene, which is critical in the endocrine regula¬ 
tion of reproduction and is located in the cen- 
tromeric part of SSC8, was found to affect 
ovulation rate by Jiang et al. (2001). 

QTLs affecting litter size traits have been 
detected on 13 different chromosomes, but 
most of them are putative results and there is 
limited overlap between studies. Indeed, the 
only overlaps concern the results of Wilkie 


et al. (1999), Bidanel et al. (2001) and Li 
et al. (2009) on SSC6, of De Koning et al. 
(2001), Tribout et al. (2008) and Li et al. 
(2009) on SSC7, of Bidanel et al. (2001) and 
Noguera et al. (2009) on SSC13 and, finally, 
of De Koning et al. (2001) and Noguera et al. 
(2009) on SSC17. Some overlaps with candi¬ 
date gene studies have also recently begun to 
appear. The leptin receptor gene ( LEPR ), 
which has been shown to be associated with 
variations in litter size by Chen et al. (2004b), 
is located in the confidence interval of the 
above-mentioned QTLs on SSC6 (Wilkie 
et al ., 1999; Bidanel et al., 2008; Li et al., 
2009). The SSC7 QTLs (De Koning et al., 
2001; Tribout et al., 2008; Li et al., 2009) 
are located in the region of the properdin 
locus (CFP), which was found to be associated 
with litter size by Buske et al. (2005). The 
prolactin receptor locus ( PRLP• Vincent et al., 
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Table 10.6. Quantitative trait loci (QTLs) for female reproductive traits. 


Trait 

Pig chromosome 
number 

Population 3 

Size 

Variance (%) 

Reference 

Age at puberty 

1,4, 6, 7, 13 

LWx MS 

476 

3.0-10.0 

Bidanel et al., 2008 


7, 8, 12 

LWx LR 

295 

2.7-9.7 

Cassady et al., 2001 


15 

LWx LR 

295 


Holl et al., 2004 


1, 10 

WC x MS 

344 


Rohrer et al., 1999 


1,7, 8, 17 

DU x ER 

454 

2.0-8.0 

Yang et al., 2008 

Ovulation rate 

4, 5, 7, 9, 13 

LWx MS 

502 

3.9-5.9 

Bidanel et al., 2008 


9 

LWx LR 

295 

3.4 

Cassady et al., 2001 


4, 8, 13, 15 

LWx LR 

114 

5.1-10.9 

Rathje et al., 1997 


3, 8, 9, 10, 15 

WC x MS 

344 


Rohrer et al., 1999 


3 

DU x MS 

234 


Sato et al., 2006 


7, 8, 15 

YO x MS 

104 


Wilkie et al., 1999 

Number of embryos 

9, 12, 18 

LWx MS 

468 

2.8-7.2 

Bidanel et al., 2008 

Uterine capacity 

8 

WC x MS 

187 


Rohrer et al., 1999 

Gestation length 

1, 9, 15 

YO x MS 

104 

9.4-23.6 

Wilkie et al., 1999 

Number mummified 

2, 6, 12 

LWx LR 

279 


Holl et al., 2004 

Total number born 

11 

LWx LR 

279 

5.1 

Cassady et al., 2001 


7, 12, 14, 17 

LW/LR x MS 

269 

2.7-8.8 

De Koning et al., 2001 


8 

LWx MS 

152 


King et al., 2003 


7, 15 

DU x ER 

299 

2.8-4.3 

Li et al., 2009 


13, 17 

IBx MS 

881 


Noguera et al., 2009 


6 

YO x MS 

104 


Wilkie et al., 1999 

Number of stillborn 

5, 13 

LWx LR 

279 

7.9 

Cassady et al., 2001 


12, 14 

LWx LR 

279 


Holl et al., 2004 


7,8 

DU x ER 

299 

3.7-5 

Li eta!., 2009 


6, 11, 14 

LW, LR 



Tribout et al., 2008 


4 

YO x MS 

104 


Wilkie et al., 1999 

Number born alive 

11 

LWx LR 

279 


Cassady et al., 2001 


6, 15 

DU x ER 

299 

3.7-5 

Li eta!., 2009 


13, 17 

IBx MS 

881 


Noguera et al., 2009 


7, 16, 18 

LW, LR 



Tribout et al., 2008 


a DU, Duroc; ER, Erhualian; IB, Iberian pig; LR, Landrace; LW, Large White; MS, Meishan; WC, White European breed 
cross; YO, Yorkshire 


1998) is located on SSC16, close to the con¬ 
fidence interval boundary of the QTLs affect¬ 
ing number born alive detected by Tribout 
et al. (2008). Conversely, no QTL has so far 
been detected in the ESR1 gene region, a 
candidate gene known to be associated with 
litter size in several populations (Table 10.7). 
It is currently not known whether these candi¬ 
date genes are only additional markers associ¬ 
ated with the observed variability or are 
responsible for these variations. The exist¬ 
ence of non-significant results in some popu¬ 
lations would tend to indicate that the 
polymorphism investigated is not the causa¬ 
tive mutation. Although several non-significant 
results from very small populations have been 
reported in the literature (e.g. Drogemuller et ai, 


2001; Gibson et a/., 2002) for the ESR1 
gene, a bias from unpublished non-significant 
results is likely to exist. 

Finally, it should be emphasized that the 
simple additive models that have in most cases 
been used so far are likely to detect only a 
(small) fraction of the genetic variability of traits 
such as litter size that are known to be affected 
by non-additive gene effects. Indeed, no fewer 
than 18 epistatic QTLs affecting number born 
alive were detected by Noguera et ai (2009) 
using a bidimensional genome scan versus two 
QTLs using conventional one-dimensional 
scans. 

The effects on reproductive traits of some 
of the major genes identified so far have been 
investigated. The halothane sensitivity gene 
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Table 10.7. Candidate genes associated with female reproductive traits in pigs. Adapted from Buske 
et al., 2006a; Onteru etal., 2009. 




Polymorphism 



Genotyped pigs 


Gene 3 

SSC b (location) 

Trait 0 

Population 

(number) 

Reference(s) 

ESR1 

1 

Pvull site (intron) TNB, NBA 

MS x SL; LW 

161;1079 

Rothschild etal., 1996 




TNB, NBA, TN 

LW 

4262 

Short etal., 1997 




TNB, NBA 

LWx MS 

275 

Van Rens etal., 2002 




TNB, NBA 

LW 

1030 

Goliasova and Wolf, 







2004 




TNB, NBA 

LW 

226 

Horogh etal., 2005 



C/T (exon 5) 

TNB, NBA 

SL 

408 

Munoz etal., 2007 

PAX5 

1 

CAT (intron 9) 

AP 

DU/LR x CW 

376 

Kuehn etal., 2009 

FSHB 

2 

FSHBMS 

TNB, NBA 

YOx ER 

289 

Li etal., 1998 



microsatellite 
(5' flanking 
region) 

NW, LWW, GL 

LWx MS 


Li etal., 2008 

EPOR 

2 

Intron 4 

Uterine 

4-way cross 

402 

Vallet etal., 2005; 




capacity 



Nonneman etal., 
2006 

LEPR 

6 

Intron 2, exon 2, 

Litter size 

YO; DU 

62; 246 

Chen etal., 2004b 



exon 18 





FUT1 

6 

Exon 2 

TNB, NBA 

PBP 

104 

Horak et a!., 2005 




TNB, NBA 

(LW x LR) x 

123 

Buske etal., 2006b 





Le 



RNF4 

6 

C U (intron 5) 

TNB, NBA 

QP 

159 

Niu etal., 2009 

BF 

7 

Intron 1 

TNB, NBA 

(LW x LR) x 

123 

Buske etal., 2005 





Le 



GNRHR 

8 

3' UTR e 

OR 

MS x LW 

200 

Jiang etal., 2001 

OPN 

8 

Intron 

TNB, NBA 

SL 

519 

Korwin-Kossakowska 







etal., 2002 

LIF 

8 

Exon 3 

NBA 

LR; LW 

850; 604 

Spotter etal., 2009 

AKR1C2 

10 

lle16Phe (Ntl79 

AP, OR, TN 

Va MS 

191 

Nonneman etal., 2006 



in coding 
region) 





RBP4 

14 

(Intron) 

TNB, NBA 

SL 

1300 

Rothschild etal., 2000 




NBA 

LR; LW 

850; 604 

Spotter et al., 2009 

PRLR 

16 

Alu site 

TNB, NBA 

LW; MS; LR 

400; 261; 416 

Vincent et al., 1998 




NBA 

SL 

273 

Drogemuller etal., 2001 




AP, OR, TNB, 

MS x LW/ 

55-77 

Van Rens and Van der 




NBA 

LR 


Lende, 2002; Van 
Rens et al., 2003 

LEP 

18 

(Exon 3) 

TNB, NBA 

SL 

519 

Korwin-Kossakowska 







etal., 2002 



Exon 3 

Litter size 

YO; LR 

62; 170 

Chen et al., 2004a 



Intron 1 

Litter size 

DU 

246 

Chen et al., 2004a 


a ESR1, estrogen receptor 1; PAX5, paired box 5; FSHB, follicle stimulating hormone beta; EPOR, erythropoietin 
receptor; LEPR, leptin receptor; FUT1, fucosyl transferase 1; RNF4, ring finger protein 4; BF, properdin; GNRHR, 
gonadotrophin releasing hormone receptor; OPN, osteopontin; LIF, leukaemia inhibitory factor; AKR1C2, aldo keto 
reductase 1C2; RBP4, retinol binding protein 4; PRLR, prolactin receptor; LEP, leptin. 
b SSC, Sus scrota chromosome. 

C AP, age at puberty; GL, gestation length; LWW, litter weight at weaning; NBA, number born alive; NW, number weaned; 
OR, ovulation rate;TN, teat number; TNB, total number born. 

d DU, Duroc; ER, Erhualian; Le, Leicoma; LR, Landrace; LW, Large White; MS, Meishan; PBP, Prestice Black Pied; 

QP, Qingping; SL, Synthetic line; YO, Yorkshire. 
e 3' UTR, three prime untranslated mRNA region. 
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(HAL; Fujii et al., 1991) and the RN gene 
(Milan et a/., 2000) are generally considered 
to have limited effects on reproductive traits. 
Significant effects of the halothane gene 
region were reported in some early studies 
but, as discussed by Sellier et al. (1987), these 
are likely to be due to the presence of a QTL 
affecting reproduction in linkage disequilib¬ 
rium with the halothane locus. Indeed, QTLs 
affecting reproduction traits have been 
detected in the vicinity of both the HAL and 
the RN genes (Tables 10.5 and 10.6), so that 
apparent effects of these two genes on repro¬ 
duction would occur in the case of linkage dis¬ 
equilibrium with the QTLs concerned. Such 
disequilibrium could be an explanation of the 
effect of the IGF2- Intron3-G3072A mutation 
(Van Laere et a/., 2003) on litter size reported 
by Buys et al. (2006). 


Gene Expression Results 

The possibility of large-scale gene expression 
analysis is a useful complementary approach 
to understand the biological basis of reproduc¬ 
tive function. For instance, Bonnet et al. 
(2008) tried to identify the genes differentially 
expressed in pig granulosa cells along the ter¬ 
minal ovarian follicle growth. They showed in 
particular the down-expression of ribosomal 
protein, cell morphology and ion-binding 
genes, and the differential expression of genes 
involved in lipid metabolism. Ross et al. 
(2009) investigated gene expression during 
porcine conceptus rapid trophoblastic elonga¬ 
tion and attachment to the uterine luminal 
epithelium. When comparing filamentous day 
12 conceptuses with large spherical concep- 
tuses, as many as 482 genes were statistically 
different, with a greater than twofold change 
in expression. The genes represented a large 
number of biological processes associated 
with cell motility, ATP utilization, cell growth, 
metabolism and intracellular transport. 
Expression analysis of genetic variation has so 
far been limited. A transcriptome analysis of 
pig folliculogenesis was performed on pigs 
from lines selected for reproductive traits by 
Caetano et al. (2004) and Gladney et al. 
(2004). These studies showed that genes 


involved in steroid biosynthesis (cytochrome 
P450 side chain cleavage enzyme, steroido¬ 
genic acute regulatory protein), tissue remod¬ 
elling (plasminogen activator inhibitor III) and 
apoptosis (calpain light chain I) were differen¬ 
tially expressed between lines. Approaches 
such as eQTL (expression quantitative trait 
locus) analyses have not yet been used to 
investigate pig reproduction owing to cost 
considerations, but should be of great interest 
in deciphering the genetic variability of this 
complex function. 


Conclusions 

Large gains in pig reproductive efficiency 
have been achieved over the last two decades. 
Sows from the highest performing maternal 
lines now farrow almost 15 piglets a litter, i.e. 
30% more than 20 years ago. This large 
improvement has often been necessary to 
allow breeders to maintain some profit from 
their activities. It has been accompanied by 
some adverse effects such as increased piglet 
mortality, and also raises questions about the 
ability of sows nursing very large litters with¬ 
out impairing their health, their longevity and 
their welfare. Much remains to be known in 
order to understand the physiological and 
genetic basis of the complex interactions 
between the sow and its litter, and to find 
ways to improve lactation efficiency without 
impairing sow longevity and piglet post- 
weaning growth. The new genomic revolution 
associated with the sequencing of the pig 
genome and the availability of new tools such 
as high-density SNP chips and new sequenc¬ 
ing technologies give scientists and breeders 
the opportunity to benefit from the use of 
much more powerful methods to understand 
the biological bases of genetic variation and to 
manage livestock populations more efficiently. 
In particular, genomic selection tools will 
undoubtedly result in more efficient genetic 
improvement programmes and potentially 
give scientists huge amounts of data to deci¬ 
pher trait genomic variability, provided that 
more accurate phenotypes are available to 
understand the complex relationships between 
genotype and phenotype. 
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Introduction 

The emergence and utilization of new reproduc¬ 
tive management strategies and tools have had a 
substantial impact on reproductive capabilities 
for the swine industry and for research labs using 
pigs as research models. Artificial insemination 

242 


(AI) in pigs has significantly extended the genetic 
resources available to commercial and seed-stock 
producers as well as researchers on a global 
scale. In addition to AI, further development and 
commercial utilization of other techniques such 
as sperm and embryo cryopreservation, deep- 
intrauterine insemination and surgical embryo 
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transfer (ET) will enable producers and researchers 
to address continuing and emerging problems 
affecting pig production and research. Another 
area that has dramatically changed in recent 
years is the use of cloning of and transgenic 
modification to pigs. Transgenic modifications to 
pigs are occurring at a rapid pace and are signifi¬ 
cantly enhancing the pig’s potential for use in 
both production agriculture and biomedical 
disease modelling, while cloning permits the 
reproduction of animals demonstrating rare 
phenotypes. The rapid progression of transgenic 
modifications to pigs is largely a result of signifi¬ 
cant advancements in somatic cell nuclear 
transfer (SCNT). As SCNT efficiency continues 
to improve, novel strategies for genetic modifica¬ 
tions to mammalian genomes will provide 
additional methods for studying conditional 
‘knockouts’ and multiple ‘knock-ins’ to better 
understand the physiology of the pig and specific 
gene function. This knowledge will result in more 
efficient use of the pig for agriculture, biomedi¬ 
cine and disease modelling. 


Modern Reproductive Technologies 

In any livestock venture, reproductive perform¬ 
ance is often the number one contributor to 
profitability. Improved reproductive perform¬ 
ance requires an understanding and utilization 
of innovative approaches to the synchronization 
of breeding and farrowing, while maintaining 
adequate litter size and maximizing the contribu¬ 
tion of elite genetics to future progeny. Below 
we have outlined a number of reproductive 
technologies that are currently being applied, as 
well as discussing where the science is going. 


Oestrus synchronization 

Synchronization of oestrus in swine herds is 
essential to maintaining large farrowing groups 
and has permitted efficiencies in production, 
management and marketing. The most com¬ 
mon method of oestrus synchronization in sows 
is weaning litters at the same time. Generally, 
most of the sows will begin oestrus within 4-7 
days following weaning. This method is widely 
used throughout the swine industry and is 


consistently reliable in sows. Gilts, in contrast, 
represent a significant proportion of the breed¬ 
ing inventories for most swine herds, and syn¬ 
chronizing gilt oestrus can be more challenging. 
One reliable method of synchronizing oestrous 
cycles in gilts is the synchronization of puberty 
onset. A well-established approach is the injection 
of PG 600 to hormonally induce follicular growth 
in prepubertal gilts (Guthrie, 1977); this can also 
be effective in acyclic post-pubertal females. 

In cattle, a single injection of exogenous 
prostaglandin F2a effectively regresses the 
corpus luteum (CL), providing a premature 
return to oestrus. However, in pigs the CLs are 
largely unresponsive to a single dose of pros¬ 
taglandin F2a and multiple injections are only 
effective during the later stages of dioestrus 
(Hallford et a/., 1975; Estill et a/., 1993). In 
the absence of competent embryos, the pig CL 
normally initiates regression around days 
15-18. The requirements of multiple injections 
that have a short window of effectiveness make 
this a cost-prohibitive strategy. 

While luteolytic agents are not widely used 
in the swine industry, the use of progesterone 
receptor agonists has become the method of 
choice in manipulating the oestrous cycles of 
females. The most common strategies involve 
the use of altrenogest (Matrix®). The majority of 
randomly cycling gilts fed 15mg/head of altre¬ 
nogest daily for approximately 14 days will 
come into oestrus 5-6 days after withdrawal of 
the altrenogest (Stevenson and Davis, 1982). 
The use of PG 600 (400 and 200 international 
units of pregnant mare serum gonadotrophin 
(PMSG) and human chorionic gonadotrophin 
(HCG), respectively) following oestrus synchro¬ 
nization with altrenogest can increase the 
number of ovulations by about eight, and short¬ 
ens the altrenogest withdrawal to oestrus inter¬ 
val by about 24h (Estienne et a/., 2001). 
However, some evidence suggests that the addi¬ 
tional embryos produced as a result of PMSG 
and HCG stimulation have compromised devel¬ 
opmental ability (Ziecik et a/., 2005). 


Breeding strategies 

AI has dramatically advanced the genetic 
improvement of pigs and is the standard 
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operating procedure on major swine farms. 
The downside of AI in pigs is that the long uter¬ 
ine horns of females require a volume and 
sperm count that limits the boar to producing 
an average of 20-30 AI doses per ejaculate. To 
overcome this, investigators have been devel¬ 
oping approaches to insemination that can 
reduce both the volume and sperm numbers 
required per dose. These approaches require 
the delivery of the semen past the cervix (i.e. 
post-cervical or intrauterine insemination) or 
further into the uterine horns (deep intrauterine 
insemination, also referred to as low-dose 
insemination) (Martinez et a/., 2001; Watson 
and Behan, 2002; Mezalira et a/., 2005). 
Strategies capable of maintaining average litter 
size while significantly reducing sperm numbers 
are required to advance the utilization of sex- 
sorted sperm and frozen-thawed semen to 
widespread commercial application. 

Finding and using techniques that reduce 
the volume and the number of sperm cells 
required for insemination and pregnancy estab¬ 
lishment while maintaining litter size require¬ 
ments will be beneficial for the exploitation of 
elite boars. Currently, swine producers typically 
utilize two doses of semen, each about 24 h 
apart, and beginning at the onset of oestrus. 
The reason that two doses are used is because 
the length of behavioural oestrus is highly vari¬ 
able between females (24-72 h), and, given that 
the timing of ovulation relative to the duration of 
oestrus is quite variable (Soede and Kemp, 1997; 
Almeida et a I., 2000), it is difficult to establish 
appropriate insemination timing for a single 
dose. Investigators are now developing methods 
to couple ovulation synchronization to currently 
used oestrus synchronization strategies in order 
to limit AI to a single dose. These methods 
include the use of gonadotrophin-releasing hor¬ 
mone agonists (Baer and Bilkei, 2004; Brussow 
et a/., 2007; Martinat-Botte et a/., 2010) and 
luteinizing hormone (Degenstein et a/., 2008). 


Semen and embryo cryopreservation 

Cryopreservation of semen and embryos from 
pigs has presented more of a challenge than in 
many other species. Both semen and embryos 
are sensitive to freeze-thawing. Although boar 


semen has been frozen and used for AI for 
35 years (Fursel and Johnson, 1975; Westendorf 
et a/., 1975), the success rates are still low. 
Boar semen is sensitive to changes in osmotic 
balance, oxidative stress, low temperatures and 
the toxic effects of exposure to cryoprotectants 
(Rath et a/., 2009). There are also significant 
differences between individual boars in the abil¬ 
ity of their sperm to survive cryopreservation 

(Rath et a/., 2009; Roca et a/., 2009). As 

sperm survivability after cryopreservation can 
be low, deep uterine insemination is the pre¬ 
ferred method of AI (Roca et a/., 2009). With 
improvements in procedures for cryopreserva¬ 
tion and AI, the application of other technolo¬ 
gies, such as using gender-sorted semen, might 
become commonplace. 

Similar to sperm, pig embryos are espe¬ 
cially sensitive to a decrease in temperature. 
Interestingly, removal of the numerous lipid 
droplets appears to alleviate this sensitivity 
(Wilmut, 1972; Polge et a/., 1974; Nagashima 
et a/., 1994; Johnson et a/., 2005). However, 
the lipid removal techniques generally compro¬ 
mise the zona pellucida, thus creating the pos¬ 
sibility of pathogen entry. A non-invasive 
method, involving no compromise of the zona 
pellucida, has been developed for high-through¬ 
put cryopreservation of in vitro -produced 
embryos (Li et a/., 2009), and we are currently 
perfecting the technique with in uiuo-produced 
embryos. Other strategies, such as destabilizing 
the cytoskeleton (Dobrinsky et a/., 2000), alter¬ 
ing the vitrification conditions (Berthelot et a/., 
2001; Misumi et a/., 2003; Beebe et a/., 2005; 
Somfai et a/., 2008) or using a solid surface for 
cryopreservation (Somfai et a/., 2008), have 
been successfully employed. Additional research 
is needed to develop these technologies for 
widespread commercial application. 


Embryo transfer 

Utilization of ET in the swine industry is much 
less advanced than the significant progress in 
the beef industry. The greatest advantage of any 
ET programme is that it permits an increase of 
the genetic contributions of select females to the 
herd. While this is a tremendous advantage to 
the beef and dairy industries, where females 
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produce only a single progeny per year, the 
pay-off is not as significant in pigs. Most sows 
are capable of farrowing an average of 2.4 litters 
a year, giving them the ability to transmit their 
genetics to more than 20 progeny a year. 

Perhaps one of the most limiting aspects 
of ET in pigs is that non-surgical collection and 
transfer of embryos is not nearly as successful 
as surgically removing and transferring 
embryos. While investigators have been capa¬ 
ble of producing piglets following non-surgical 
ET (Li et a/., 1996; Cuello et a/., 2005), the 
efficiency still lags significantly behind surgical 
methodologies. Because of the inefficiency of 
non-surgical methods, ET is being used for pri¬ 
marily specialized purposes, such as the trans¬ 
fer of transgenic and/or cloned embryos and, 
for example, determining the developmental 
competency of embryos following in vitro 
manipulations (Li et a/., 2009). 

More recently, pig ET has become some¬ 
what commercialized in an effort to flush embryos 
encapsulated in their zona pellucida from dis¬ 
eased herds, appropriately wash/sanitize them 
(Bureau et a I., 2005) and transfer them to recipi¬ 
ent pigs that have a specific pathogen-free status. 
While this strategy is not economically viable for 
many swine herds, it is being utilized as a method 
for salvaging elite and/or valuable genetic stock 
from herds before depopulation. 


Cloning 

Cloning of pigs also has tremendous potential as 
an assisted reproductive technique. The first 
cloned pigs via SCNT were produced by using 
embryo-derived cells (Prather et a/., 1989) and 
the technique has since been conducted with a 
wide variety of somatic cells (described later). 
The process of SCNT involves the enucleation 
of metaphase Il-arrested oocytes, the reconstruc¬ 
tion of those oocytes by placing a somatic cell in 
the perivitelline space, followed by cell fusion 
and oocyte activation, short term in vitro culture 
and transfer to synchronized surrogates. As will 
be described, cloning is essential to the ability to 
make transgenic pigs, particularly those with a 
gene knock-in or knockout. However, SCNT 
also could have a significant impact on produc¬ 
tion agriculture through the ability to reproduce 


animals carrying very valuable and rare geno¬ 
types, recreate injured, deceased or diseased pigs 
and produce boars from castrated barrows. 


The Utilization of Transgenic Pigs 

A transgenic pig is one whose genetic material 
has been altered to influence the expression 
and abundance of specific genes, with the ulti¬ 
mate objective of producing a specific, desired 
phenotype. This is most commonly done by 
introducing exogenous DNA, produced 
through recombinant DNA technology, into 
the genome of the pig. The appropriate pro¬ 
duction method and expected results can vary 
significantly, depending on whether the desired 
modification is random, targeted and/or tissue 
specific. The ability to produce transgenic pigs 
has dramatically improved over the past 10-15 
years. Here we will describe the uses of trans¬ 
genic pigs, the methods used to produce them 
and the strategies being utilized to improve the 
efficiency of transgenic pig production. 


Current use of transgenic pigs 

While large animals have proven to be very 
functional for our understanding of physiology, 
cell biology, molecular biology and genetics, 
our current ability to make genetic modifica¬ 
tions to pigs has dramatically expanded their 
usefulness. Utilization of transgenic pigs to cre¬ 
ate models of human diseases and pathologies 
is already of value to the scientific community. 

One unique example of the value that 
transgenic livestock offer is the use of their mam¬ 
mary glands for the production of heterologous 
proteins. Surprisingly, in addition to dairy breeds 
of cattle and goats, the pig is also being used as 
a source of heterologous proteins, such as 
human coagulation factors VIII and IX (Faleyanda 
et a/., 1997; Lindsay et a/., 2004) and protein 
C (Van Cott, 1999). While, clearly, pigs are not 
the only species capable of being genetically 
modified to produce a desired protein product in 
their milk, they offer some unique advantages 
compared with other species. The reason these 
genetic modifications have been made in pigs in 
contrast to in a dairy breed of cattle is that pigs are 
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smaller and less expensive to create and main¬ 
tain, have a shorter gestation length, reach 
puberty faster and, being litter bearing, allow for 
a rapid expansion of the transgenic herd. Also of 
potential importance is that post-translational 
modification of transgene protein products is 
thought to occur more accurately in pigs (Van 
Cott et a/., 2004). This could be particularly 
important with regard to the production of pro¬ 
teins such as haemophilic factors that require 
specific post-translational events that may influ¬ 
ence their biological activity and half-life, such as 
the appropriate glycosylation of glycoproteins. 

Valuable disease models in pigs include 
retinitis pigmentosa (Petters et a/., 1997; Ross 
et a/., 2009), cystic fibrosis (Rogers et a/., 
2008a,b), diabetes (Renner et a/., 2008), 
mammary tumours (Yamakawa et a/., 1999) 
and, more recently, Alzheimer’s syndrome 
(Kragh et a/., 2008), to name but a few. 
Genetic modifications to study the incidence of 
pathologies such as cardiovascular disease in 
pigs and in humans have also been made (Lai 
et a/., 2006). In addition to these applications, 
animals have been made that have tetracycline- 
dependent expression (Kues et a/., 2006), or 
express marker genes such as the enhanced 
green fluorescent protein (eGFP) (Park et a/., 
2001), thus enabling a novel method of track¬ 
ing cells transplanted into wild-type animals. 

Transgenic Pig Development 

Numerous milestones have been achieved in the 
development of methods capable of producing 
cloned and transgenic pigs (Table 11.1). The 


development of efficient methods for producing 
transgenic large animals has been and still is 
essential to understanding the biology and 
increasing the utility of pigs. The first transgenic 
animal created was a mouse, produced nearly 
30 years ago (Gordon et a/., 1980; Brinster 
et a/., 1981; Costantini and Lacy, 1981; 
Wagner et a/., 1981). A few years later, gene 
‘knockout’ mice were created by utilization of 
homologous recombination in embryonic stem 
(ES) cell lines (Doetschman et ai, 1987; Thomas 
and Capecchi, 1987). Transmission of these 
methodologies to pigs is rapidly progressing, 
although not without some difficulty. Several pri¬ 
mary methods have been used to create geneti¬ 
cally modified pigs (Table 11.2), each of which 
can have specific advantages and limitations 
depending on the desired outcome. Below we 
will briefly describe the primary methods that 
have produced the majority of transgenic pigs. 


Pronuclear injection 

The first transgenic animals created were mice 
produced during the early 1980s through pro- 
nuclear injection of DNA in the one-cell stage 
zygote (Gordon et a/., 1980; Brinster et a/., 
1981; Costantini and Lacy, 1981; Wagner 
et ai, 1981). The process involves the injec¬ 
tion of linearized DNA into the pronuclei of a 
single-cell zygote, and is repeatable and cap¬ 
able of producing offspring with a randomly 
integrated transgene. The utilization of the 
technique developed in rodents followed suit in 
large animal species as well, and has resulted in 
the production of transgenic cattle (Bondioli 


Table 11.1. A timeline of some of the significant achievements leading to the ability to produce 
transgenic pigs. 

Milestone Reference 


Transgenic pig via pronuclear injection 

Cloned pig from embryonic cells 

Sperm mediated gene transfer in pigs 

Cloned pig from fetal and adult cells 

Transgenic pig via oocyte transduction 

Transgenic pig via somatic cell nuclear transfer (SCNT) 

Gene ‘knockout’ pig using SCNT 

Intracytoplasmic sperm injection (ICSI)-mediated transgenic pig 
Gene ‘knock-in’ pig via SCNT 


Hammer eta/., 1985 
Prather etal., 1989 
Lavitrano etal., 1997 
Polejaeva etal., 2000 
Cabot et ai, 2001 
Park et al., 2001 
Lai etal., 2002b 
Kurome et ai, 2007 
Rogers etal., 2008a 
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Table 11.2. Primary methods used to create genetically modified pigs. 

Method Advantages/limitations 

Pronuclear injection No control over transgene insertion site 

Oocyte transduction Potential for mosaic transgene integration 

Intracytoplasmic sperm injection Low percentage of offspring are transgenic 

(ICSI)-mediated gene transfer 

Genetic modification of somatic The only established method for transgenic knock-in or knockout 

cell followed by nuclear transfer Longer time to create transgenic cell lines 

Potential complications of large offspring syndrome 


et a/., 1991), pigs (Hammer et a/., 1985; Vize 
et a/., 1988; Petters et a/., 1997; Bleck et a/., 
1998) and goats (Wang et a/., 2002). 


Sperm- and intracytoplasmic sperm 
injection (ICSI)-mediated transgenesis 

Another strategy for producing transgenic ani¬ 
mals is by using sperm for transgene delivery 
(Lavitrano et a/., 1989). Sperm-mediated 
transgenesis can occur by AI or by intracyto¬ 
plasmic sperm injection (ICSI), with sperm car¬ 
rying the transgene concerned. ICSI-mediated 
transgenesis occurs following the chemical or 
mechanical disruption of the sperm membrane, 
a short incubation with double-stranded DNA, 
injection into a metaphase Il-arrested oocyte 
and subsequent activation (Perry et a/., 1999; 
Kurome et a/., 2006; Pereyra-Bonnet et a/., 
2008). The procedure overcomes the tedious 
task of introducing the DNA into the pronuclei 
by utilizing the fractured membrane of sperm 
to carry exogenous DNA into the oocyte. In 
pigs, ICSI-mediated delivery of a transgene 
produced seven fetuses from 219 oocytes fer¬ 
tilized via ICSI-mediated gene transfer, of which 
two were transgenic (Kurome et a/., 2007). 
Sperm-mediated transgene delivery has been 
utilized for the production of human decay 
accelerating factor in transgenic pigs (Lavitrano 
et a/., 1997) and appears to be somewhat effi¬ 
cient in incorporating multiple transgenes 
simultaneously (Webster et a/., 2005). 

Oocyte transduction 

Genetic modification of germ cells has to date 
only been achieved in the oocyte. For example, 


a replication-defective retrovirus can be used to 
infect an unfertilized oocyte. These vectors inte¬ 
grate into metaphase chromosomes. Because 
the oocyte is arrested in metaphase II of meio- 
sis, this is an excellent strategy for obtaining 
transgene integration. The process was first 
demonstrated in cattle by injecting the vector 
under the zona pellucida, permitting infection 
and integration, fertilizing the oocyte, culturing 
to the blastocyst stage, and then ET (Chan 
et a/., 1998). A similar strategy was used in 
pigs to create the first eGFP-transgenic pigs 
(Cabot et a/., 2001). 


Genetic modification of a somatic 
cell followed by nuclear transfer 

SCNT involves the enucleation of a met¬ 
aphase Il-arrested oocyte and replacement of 
the original nucleus with the nucleus from a 
somatic cell, followed by activation of the 
reconstructed embryo. Somatic cell cloning 
has been essential for use in domestic live¬ 
stock species because of the inability to isolate 
true embryonic stem cells in these species. 
The first domestic animal cloned from adult 
somatic cells was Dolly the sheep, born in 
1996; the cloning used mammary gland cells 
for donor nuclei (Wilmut et a/., 1997). Since 
this original demonstration that the mammalian 
oocyte is capable of remodelling and repro¬ 
gramming a differentiated nucleus, somatic 
cell cloning of other large animal species 
(Cibelli et a/., 1998; Baguisi et a/., 1999; 
Galli et a/., 2003; Woods et a/., 2003; Jang 
et a/., 2007; Oh et a/., 2008; Yin et a/., 
2008), including pigs (Polejaeva et a/., 2000), 
has occurred rapidly. While large animals had 
been previously cloned (Prather et a/., 1987, 
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1989; Sims and First, 1994) using embryo- 
derived cells, cloning with differentiated somatic 
cells presented the opportunity to identify 
somatic cell types that can be produced in 
greater quantities and are easier to genetically 
modify. Making genetic modifications to the 
somatic cell before nuclear transfer has enabled 
the production of numerous transgenic pigs. 

While pronuclear injection, ICSI-mediated 
transgenesis, sperm-mediated gene transfer 
and oocyte transduction are each capable of 
producing transgenic pigs, SCNT following 
genetic modification to the donor nuclei has 
become the preferred method for a variety of 
reasons. SCNT is advantageous over oocyte 
transduction, pronuclear injection and ICSI- 
mediated transgenesis as all offspring produced 
through SCNT are expected to be transgenic, 
and the modifications are always expected to 
demonstrate germline transmission. In addi¬ 
tion, SCNT using clonal transgenic cell lines 
allows the investigator to preliminarily charac¬ 
terize the modification by addressing the 
number of integration sites, the transgene copy 
number and the chromosomal location before 
creating piglets. Importantly, because homolo¬ 
gous recombination occurs at such a low fre¬ 
quency, the process of knocking in or out a 
gene has only been accomplished in pigs 
through SCNT using appropriately modified 
somatic cells. 


Genetic Modification of Somatic Cells 

for Nuclear Transfer 

Genetically modifying a somatic cell followed 
by its use for nuclear transfer has been effect¬ 
ively used to generate cloned transgenic pigs 
(Lai et a/., 2006; Rogers et a/., 2008a). The 
process from designing the construct to char¬ 
acterizing the transgenic offspring requires 
consideration of numerous factors, some of 
which are presented in Fig. 11.1, to produce 
the desired outcome. The primary methods of 
gene targeting (i.e. gene knockout) and gene 
addition (i.e. gene knock-in) are improving in 
terms of both efficiency and specificity. Gene- 
knockout animals were first demonstrated by 
modification of the genome using homologous 
recombination (Doetschman et a/., 1987; 


WA 


Thomas and Capecchi, 1987). Because homol¬ 
ogous recombination is such a rare event, it is 
not an effective strategy for genome manipula¬ 
tion during the initial stages of embryonic 
development - the point at which transgenic 
modification occurs following pronuclear injec¬ 
tion or ICSI-mediated transgenesis. A major 
advantage of using mice for transgenesis by 
homologous recombination was the availability 
of ES cells that can grow indefinitely and are 
capable of being introduced into early-stage 
embryos and contributing to chimeric off¬ 
spring. Significant efforts have been invested 
into the creation of ES cell lines in large domes¬ 
ticated animals, although these have been 
largely unsuccessful (Talbot and Blomberg, 
2008). 


Exogenous DNA delivery into the 
potential donor cell 

Efficient generation of genetic modifications 
in donor cell genomes relies not only on the 
targeting/integration strategy but also on the 
efficient introduction of the exogenous DNA 
into the potential donor cell. To this end, a 
variety of methods have been quite effective, 
including lipid-based delivery (Flyun et a/., 
2003; Lee et a/., 2005), viral delivery (Lai 
et a/., 2002a; Rogers et a/., 2008a) and 
electroporation (Arat et a/., 2001; 

Ramsoondar et a/., 2003; Watanabe et a/., 
2005; Ross et at., 2010). It has more recently 
been demonstrated that utilization of the 
Amaxa Nucleofection System™ was capable 
of introducing exogenous DNA into 79% of 
surviving cells as determined by eGFP expres¬ 
sion, compared with 53% of surviving cells 
using electroporation (Nakayama et a/., 
2007). We were able to achieve similarly high 
transient transfection efficiency through elec¬ 
troporation by using repeated square-wave 
pulses. Using optimized electroporation con¬ 
ditions, more than 80% of stably transgenic 
colonies were PCR positive for the transgene 
when the copy number ratio of selectable 
marker to transgene was 1:1 (Ross et a/., 
2010). Because of this frequency, transgen¬ 
esis via randomly integrated gene addition is 
relatively feasible. 
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Design of the DNA construct 


Targeting vector 
Promoter 

(endogenous versus exogenous) 

(tissue specific versus constitutive versus ubiquitous) 

Coding sequence 
Include untranslated regions? 

Poly(A)tail (mRNA stability) 

Selectable marker (attached to vector versus co-delivery) 
Positive selection (i.e. neomycin resistance) 

Negative selection (i.e. thymidine kinase) 

Other methods for targeted modification (i.e. promoter trapping) 




Delivery of the DNA construct 



Electroporation 
Lipid-mediated transfection 
Viral delivery 




Cell selection 


> 


Selectable marker (i.e. geneticin for neomycin resistance) 
Negative selection (ganciclovir for thymidine kinase) 
Fluorescence-activated cell sorting 




Somatic cell nuclear transfer 



In vitro maturation of metaphase ll-arrested ooctyes 
Enucleation and reconstruction with genetically modified somatic cell 
Cell fusion and egg activation 
In vitro culture (hours to days) 




Embryo transfer and gestation 


Synchronization of surrogates 
Surgical embryo transfer 
Gestation 

Caesarean versus natural birth 


y 


Genotyping offspring 


Screening of offspring for transgene: 

Polymerase chain reaction (PCR) 
Southern blotting 
Fluorescent in situ hybridization 




Characterization of founders 


Transgene expression analysis: 

mRNA transgene abundance 
Western blotting 
Phenotype characterization 


Fig. 11.1. General workflow and some considerations for creating genetically modified pigs through 
somatic cell nuclear transfer (SCNT). 


Random integration 

For successful production of transgenic pigs car¬ 
rying a gene addition, exogenous DNA must be 
delivered to the nucleus and, following a double- 
stranded DNA break, be integrated into the host 
genome. While gene knockout in pigs has only 
occurred through somatic cell modification 
followed by SCNT, random integration of an 
exogenous transgene can occur through any of 


the above-discussed strategies for producing a 
transgenic pig. Transgene constructs used for gene 
additions are commonly represented by coupling 
the coding sequence of a gene to a specific 
promoter to produce a protein product expressed 
in a tissue-specific (Faleyanda et a/., 1997) or a 
ubiquitous manner (Whitworth et a/., 2009). 

One of the greatest difficulties with respect 
to integrating a transgene into the genome of a 
mammal is producing sufficient offspring to 
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screen and identify those with the most appro¬ 
priate phenotype. The reason for this is the mul¬ 
tiplicity of events that can vary between each 
founder; the precise site into which the trans¬ 
gene integrates, the number of integration sites 
and the number of transgene copies in each inte¬ 
gration site can all vary between founders. Most 
integration events in mammalian cells following 
exogenous DNA delivery via electroporation 
result in only a single integration site (Nakanishi 
et ai, 2002). However, the number of transgene 
copies within the integration site and the ability 
of the transgene to be expressed from different 
loci vary considerably. An example of this is the 
recent production of a new miniature pig model 
of retinitis pigmentosa (Ross et al., 2009). Six 
male founder piglets were created by adding the 
P23H human rhodopsin gene into the pig 
genome. While each founder is PCR positive for 
the transgene, fluorescent in situ hybridization 
indicated that transgene integration occurred on 
a variety of chromosomes and that the progres¬ 
sion of retinal degeneration as determined 
through electroretinography varied significantly 
between founders. 

Targeted modifications 

The established method of gene targeting by 
homologous recombination, established in 
mouse ES cells (Thomas and Capecchi, 1987; 
Doetschman et a/., 1987), remains the primary 
method by which targeted genetic modifications 
in other mammals have been accomplished. 
A targeting vector possessing a desired trait 
mutation is introduced into somatic cells (e.g. 
fetal fibroblasts) followed by colony selection 
and screening to identify appropriately modified 
cell lines for SCNT. Most genomic modifications 
will be the result of random incorporation of the 
targeting vector into the genomic DNA (Iiizumi 
et a/., 2008). However, in rare events, the arms 
of the targeting vector match up with homolo¬ 
gous chromosomal DNA, and the exogenous 
DNA (situated between the arms of homology 
in the targeting vector) will be inserted into 
the genome by homologous recombination 
(Smithies et a/., 1985). Positive and negative 
selection facilitates the identification of trans¬ 
genic somatic cells potentially having the desired 
targeting event. 


Another potentially useful method of 
introducing targeted modifications are zinc fin¬ 
ger nucleases (ZFNs). ZFNs introduce DNA 
double-strand breaks at specific DNA sequences 
and have the capacity to increase homologous 
recombination-mediated gene targeting fre¬ 
quency by nearly 1000-fold (Porteus and 
Carroll, 2005; Morton et ai, 2006; Moehle 
et a/., 2007). ZFNs comprise a non-specific 
FokI (restriction endonuclease) cleavage domain 
and three to four Cys 2 His 2 zinc finger (ZF) 
DNA-binding domains (Tan et a/., 2003). Gene 
disruption and repair (i.e. gene therapy) can be 
achieved with ZFNs in human and 
Caenorhabditis elegans somatic cells (Urnov 
et ai, 2005; Morton et ai , 2006). While ZFNs 
have significant potential in terms of target 
specificity, designing and delivering functional 
ZFNs can be quite challenging (Ramirez et ai, 
2008). Efforts are currently under way to 
design and test ZFNs to target genes in swine 
fibroblasts, with intentions to develop trans¬ 
genic swine via SCNT (Whyte et ai , 2008). 

Other strategies, including the use of recom¬ 
binant adeno-associated virus (AAV) vectors, 
were used to deliver genetic constructs that were 
effective for gene-specific targeting in pig fetal 
fibroblasts (Rogers et aI., 2008a). Not only was 
the vector highly efficient in the production of 
cystic fibrosis transmembrane regulator (CFTR)- 
targeted fetal fibroblasts, but these authors also 
demonstrated that individual cell lines have very 
different targeting efficiencies. For example, out 
of six cell lines used for targeting the CFTR 
gene, the frequency of targeted clonal cell lines 
as a percentage of transgenic lines ranged from 
0.07 to 10.93% (Rogers et al. , 2008a). 

Conditional/inducible genetic engineering 

Incomplete understanding of all specific gene 
functions can result in embryonic lethality in 
developing embryos possessing a trait knockout, 
as has been demonstrated in the mouse (Tsumura 
et al. , 2003). Overcoming the embryonic 
lethality of specific gene knockouts underscores 
the importance of developing targeting strate¬ 
gies that can be conditionally induced. Conditional 
mutation by site-specific recombination has been 
reviewed in great detail elsewhere (Gossen and 
Bujard, 2002; Glaser et a/., 2005). Essentially, a 
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site-specific recombinase, commonly Cre, is used 
to conditionally control gene expression. Cre 
recombinase catalyses a DNA recombination event 
removing DNA between two loxP sites (34-bp rec¬ 
ognition sites), each composed of two 13-bp 
inverted repeats flanking an 8-bp central element 
(Primrose and Twyman, 2006). Cre has utility in 
excising unwanted selectable markers, such as the 
neomycin resistance gene (neo) in transgenic con¬ 
structs (Kaartinen and Nagy, 2001). The basic 
premise is to design targeting vectors with part of 
the preferred endogenous gene flanked on each 
side by loxP sites, also termed ‘floxed’ (Sauer, 
1998). Insertion of the loxP sites into introns flank¬ 
ing an exon critical for protein function allows nor¬ 
mal expression to continue until Cre recombinase 
is present, driven by temporal or tissue-specific 
promoters (Primrose and Twyman, 2006). Because 
Cre recombinase does not require accessory pro¬ 
teins or cofactors for its activity, it is suitable for use 
in mammalian cells (Glaser et a/., 2005). 

One example of this is the conditional 
knockout of Dicer , which codes for a critical 
enzyme in miRNA (micro RNA) biogenesis dur¬ 
ing oocyte development (Tang et a/., 2007). This 
was accomplished by mating a transgenic mouse 
containing the floxed Dicer allele (Yi et a/., 2006) 
with a mouse transgenic with the ZP3 promoter 
driving the expression of Cre recombinase (de 
Vries et a/., 2000). Because ZP3 is an oocyte- 
specific promoter, the result was the ability to 
determine specific temporal and spatial impacts 
of miRNA biogenesis during oocyte growth. 
Modifications such as these will be essential in 
further expanding the pig as a research model 
for human health and physiology. 

Type of donor cell affects success 
of somatic cell nuclear transfer (SCNT) 

Despite significant progress in creating clones 
of large animals without the utilization of ES 
cells, the process is still inefficient. Because the 
efficiency largely depends on the extent of 
nuclear remodelling and reprogramming, 
numerous cell types have been tested for their 
ability to produce clones through SCNT. A vari¬ 
ety of cells has been utilized for SCNT in pigs, 
such as pre-adipocytes (Tomii et a/., 2005), sali¬ 
vary gland-derived progenitor cells (Kurome 
et a/., 2008), fetal somatic stem cells (Hornen 


et a/., 2007), skin-derived stem cells (Hao et a/., 
2009), adult fibroblasts (Beebe et a/., 2007; 
Brunetti et a/., 2008) and fetal fibroblasts (Park 
et a/., 2001; Lai et a/., 2002a,b; Hyun et a/., 
2003). Fetal fibroblasts have proven valuable for 
the production of transgenic pigs because they 
are easily collected and cultured, are capable of 
being genetically modified and possess the abil¬ 
ity to produce live offspring. While fetal fibro¬ 
blasts, like other somatic cells, are capable of 
producing live offspring, the efficiency at which 
nuclear remodelling and reprogramming are 
sufficient to produce live offspring remains low. 
It is hypothesized that many of the differences in 
developmental efficiency between donor cells 
following SCNT may be a result of genomic 
methylation differences (Bonk et al ., 2007). 


Improving somatic cell remodelling 
and reprogramming 

During normal, in vivo reproduction, shortly 
after fertilization, both the male and female 
pronuclei undergo global demethylation, which 
contributes to the production of a totipotent 
zygote. The zygote then undergoes several 
rounds of holoblastic cleavage and the activa¬ 
tion of the embryonic genome at around the 
four- to eight-cell stage of development in the 
pig. As the embryo continues to develop, 
the methylation patterns and chromatin struc¬ 
ture (i.e. histone code) begin to differentiate 
between cells, enabling the regulation of their 
transcriptomes and proteomes to contribute to 
the production of multiple cell types by the 
blastocyst stage of development. During SCNT, 
the oocyte must remodel (i.e. change the struc¬ 
ture of the chromatin) and reprogramme (i.e. 
alter the transcriptional profile) the somatic cell 
genome to enable the appropriate spatial and 
temporal gene expression required for success¬ 
ful embryonic and fetal development. 

The mechanisms involved during SCNT 
are complex and poorly understood, and 
require cytoplast-mediated reprogramming of 
the donor nucleus from a differentiated state to 
a totipotent condition (Mitalipov et a/., 2007). 
Remodelling and reprogramming of differenti¬ 
ated somatic nuclei into a totipotent embryonic 
state by SCNT are not efficient, and the 
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mechanism by which this remodelling occurs is 
not well understood. Many epigenetic modifi¬ 
cations occurring during normal embryonic 
development are likely required by the somatic 
cell genome for SCNT to be successful. 

Accruing evidence suggests that inefficien¬ 
cies in epigenetic reprogramming in SCNT 
embryos are the result of errors in DNA methyla- 
tion and chromatin remodelling, which may con¬ 
tribute to the poor survival of SCNT-derived 
embryos (Dean et aL, 2001; Kang et a/., 2001; 
Santos et a/., 2003; Ohgane et a/., 2004; Bonk 
et a/., 2007). This is supported by the observa¬ 
tion of molecular abnormalities in the placenta 
and tissues of live-cloned animals (Humpherys 
et a/., 2002; Inoue et a/., 2002; Jiang et aL, 
2008). One potential contributor to the epigen¬ 
etic reprogramming of SCNT-derived embryos 
is histone modification following SCNT. The his¬ 
tone code is a cellular memory responsible for 
maintaining the identity of differentiated cells 
(Bird, 2002) and correlates with gene activation 
through its effect on chromatin accessibility 
(Th’ng, 2001). Treatment of pig SCNT embryos 
with trichostatin A (TSA), a histone deacetylase 
inhibitor (HDACi), resulted in higher in vitro 
embryonic development than in controls (Zhang 
et a/., 2007; Li et a/., 2008). A downside to 
TSA is that it can be teratogenic when the con¬ 
centration is high or the exposure is long 

(Svensson et a/., 1998). Scriptaid, a HDACi 
with low toxicity that enhances transcriptional 
activity and protein expression (Su et a/., 2000), 
significantly enhanced the development of SCNT 
embryos to the blastocyst stage when outbred 
Landrace fetal fibroblast cells (FFCs) or NIH mini 
pig inbred FFCs were used as donors, compared 
with results from an untreated group (Zhao 
et a/., 2009, 2010). Considering that the repro¬ 
gramming of nuclei following nuclear transfer 
only happens during a limited time before zygote 
genome activation, the relaxation of chromatin 
structure by histone acetylation, which corre¬ 
sponds to a transcriptionally permissive state, 
might contribute to successful cloning. 

In addition to promoting the efficiency and 
capacity of the oocyte to appropriately remodel 
and reprogramme the somatic cell genome, the 
identification of somatic cells that are capable of 
being genetically modified and have a pluripo- 
tent phenotype, such as ES cells in mice, may 
improve the efficiency of SCNT to make trans¬ 
genic pigs. While true ES cells have been difficult 


to obtain in pigs, a renewed interest in the devel¬ 
opment of induced pluripotent stem cells has 
recently developed (Esteban et aL, 2009; Ezashi 
et aL, 2009; Roberts et aL, 2009). 

Future of Transgenic Pigs 

The increased efficiency of SCNT, coupled with 
the rapidly increasing understanding of the bio¬ 
logical functions of encoded gene products, will 
enable the production of transgenic pigs that 
can accurately address issues affecting pig pro¬ 
duction and the use of the pig as a biomedical 
model. While the technologies have advanced 
to the degree where the types of genetic manip¬ 
ulations are becoming unlimited, some poten¬ 
tial areas that could be significantly affected by 
genetically modified pigs are discussed below. 

Reproductive efficiency 

While litter size in pigs has increased over the 
past few decades, pigs still offer significant poten¬ 
tial to increase the number and size of piglets 
weaned. One strategy to improve milk produc¬ 
tion was the creation of transgenic sows over¬ 
expressing bovine alpha-lactalbumin in their milk 
(Bleck et aL, 1998). Piglets raised by the trans¬ 
genic sows over-expressing bovine alpha- 
lactalbumin demonstrated an increased milk 
production during the first 9 days of lactation and 
improved litter weight gain (Noble et aL, 2002). 

It was also thought that ovulation rate may 
be increased in transgenic pigs that express the 
human beta-cell leukaemia/lymphoma-2 (BCL2) 
gene in follicles by preventing or reducing apop¬ 
tosis (Guthrie et aL, 2005). Unfortunately, an 
improvement in reproductive function was not 
observed, but the study underscores the useful¬ 
ness of expressing a transgene to better under¬ 
stand biological systems for affecting reproductive 
efficiency in pigs. 


Growth and development 

A variety of transgenic modifications have been 
made in pigs to enhance and/or study the 
mechanisms of growth and development. 
These include pigs transgenic for the mouse 
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metallothionein-1 promoter driving expression 
of human or bovine growth hormone, human 
growth hormone releasing factor or human 
insulin-like growth factor-I (Fursel et a/., 1990, 
1999). 

As the world population and demand for 
animal protein continue to increase, the 
demand for animals capable of more efficiently 
converting cereal grains into lean tissue will 
increase. Discovery and understanding of the 
molecular controls of nutrient transport and 
utilization will enable the manipulation of 
genomes to enhance or diminish the activity of 
specific proteins that contribute to the efficient 
utilization of feed. 


Disease resistance 

Infectious diseases cause a significant negative 
economic impact on production agriculture. 
Either identification of animals that are natu¬ 
rally resistant to disease, as with brucellosis in 
cattle, and cloning those animals (Westhusin 
et a/., 2007), or modifying the animal so that it 
is resistant to disease, as with prions (Golding 
et a/., 2006), should result in a population that 
is more resistant to disease. Many infectious 
agents attach to cell-surface molecules and are 
then internalized and initiate infection of the 
cell. Modification of cell-surface molecules has 
already been achieved in pigs (Lai et a/., 2002b; 
Kolber-Simonds et a/., 2004), and this results 
in a change in the antigenicity of the cell sur¬ 
face (Kuwaki et a/., 2005). Similar modification 
to other cell surface molecules may render the 
cells/pigs resistant to infection by a variety of 
infectious agents. 


Transgenic pigs for xenotransplantation 

Because pigs have anatomy similar to humans 
they represent potential models as a species 
amenable for the harvest of organs for 
xenotransplantation. In fact, this area of 
research has gained significant interest in the 
past decade after the ability to make genetic 
modifications to the pig cells used for SCNT. 

Pigs have long been considered a source of 
organs for xenotransplantation to humans. Their 
anatomy, physiology and genetics are similar to 


those of humans (Prather et a/., 2008). An initial 
problem with xenotransplantation is the hyper¬ 
acute rejection (HAR) that occurs following 
transplantation from pigs to primates, as a result 
of natural antibodies that are capable of initiating 
complement activation and eliciting a cascade of 
events that end in immediate rejection of the 
organ (Dalmasso et ai, 1992). 

A large number of genetic modifications 
have been made to make the pig organs less 
immunogenic to the primate. These include 
making modifications to eliminate HAR, such 
as adding complement modifiers (Langford 
et a/., 1994; Diamond et a/., 1996, 2001; 
Zhou et a/., 2005), remodelling the cell surface 
carbohydrates (Bondioli eta/., 2001; Miyagawa 
et a/., 2001), preventing antithrombosis 

(Petersen et a/., 2008) and knocking out alpha- 
1,3-galactosyltransf erase (Dai et a/., 2002; Lai 
et a/., 2002b; Ramsoondar et ai, 2003). 
Additional modifications are now focusing on 
post HAR (Tu et a/., 2003; Klose et a/., 
2005). 

As progress continues in the production 
of a transgenic pig for which the organs are 
capable of being utilized for transplantation 
into humans, other issues will also need to be 
addressed. One of these is the matter of 
porcine endogenous retroviruses (PERVs) 
(Dieckhoff et a/., 2008). While their impact 
on xenotransplantation is unclear, certain 
types of PERVs have been documented to be 
transmissible across species in vitro (Patience 
et al ., 1997; Wilson et ai, 1998). Because of 
their multiplicity throughout the pig genome, 
their entire removal by ‘knocking out’ each 
retrovirus through transgenesis or by develop¬ 
ing a breeding strategy that is capable of pro¬ 
ducing pigs lacking all known retroviruses 
both represent daunting tasks. Instead, some 
efforts have focused on the development of 
short interfering RNA (siRNA) constructs 
whose expression can effectively knock 
down PERV expression in pigs (Ramsoondar 
et a/., 2009). 

Transgenic pigs for biomedical models 

While a tremendous amount of basic biological 
information has been learned from transgenic 
mice, an incredible amount is also being gained 
from the pig (Lunney, 2007). In some cases, 
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modifications to the pig genome to model human 
diseases have been shown to produce a pheno¬ 
type more similar to human conditions than 
other existing models (Rogers et a/., 2008b). 
The larger size of the pig makes it more expen¬ 
sive to maintain as a research model relative to 
rodents; however, the size can be used advanta¬ 
geously for issues such as determining drug dos¬ 
age in preclinical trials and developing therapeutic 
strategies requiring surgical intervention. 
Transgenic pigs will continue to be made to 
model human genetic diseases, such as muscular 
dystrophy, sickle cell anaemia and many others, 
including genetic anomalies that cause a predis¬ 
position to specific cancers such as BRCA1 and 
BRCA2 mutations in human breast cancer. 


Conclusions 

As basic scientific discovery exposes specific 
molecular mechanisms that can be controlled 
or influenced by the modification of specific 
genetic sequence(s), the opportunities to 
improve animal production agriculture and bio¬ 
medicine using the pig become unlimited. While 
there is no doubt that scientists and producers 
will continue to improve pig production effi¬ 
ciency through better management practices 
that utilize novel tools in reproduction, the abil¬ 
ity to make transgenic modifications to the pig 
has given us tremendous opportunities to better 
understand physiology and improve the health 
and well-being of both animals and humans. 
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Plate 1. (A) Segregation of coat colour among F 2 animals from an intercross between European wild boar and 
Large White domestic pigs. Two alleles are segregating at the Extension/MCIR locus: the wild-type allele (E + ) 
and the recessive allele for black spotting (E p ). Three alleles are segregating at the Dominant white/KIT locus: the 
recessive wild-type allele (/) and the alleles for Patch (l p ) and Dominant white colour (/). The genotypes of the 
depicted animals are, from left to right: 1: ///', E P IE P \ 2: i/i, E p /E p ; 3: Hi, E + I -; 4: l p /i, E + !-\ 5: //-, -/-. Photo: Mats 
Gerentz, Swedish University of Agricultural Sciences. (B) Median-joining network tree of MC1R alleles in wild and 
domestic pigs from Europe and Asia. Alleles are represented by circles and a four-digit allele designation (see 
Table 3.3), while squares represent predicted intermediate forms that have not yet been found. Thin black lines 
perpendicular to lines connecting alleles represent synonymous changes, while thicker red lines represent 
non-synonymous changes. Colours inside circles and squares represent observed and predicted phenotypes, 
respectively; question marks inside two of the squares indicate that the associated phenotypes cannot be 
predicted because they are intermediate forms between alleles differing by two non-synonymous substitutions. 
The asterisks associated with the synonymous substitutions leading to alleles 0203 and 0104 indicate the only 
instance of an identical mutation at different locations in the tree. Based on Fang et al. (2009). Publication of this 
figure was generously supported by PIC (part of Genus pic), Hendersonville, Tennessee, USA. 
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Plate 2. One of the first comparative Zoo-FISH maps of the pig and human genomes. An index of colour patterns 
represents individual human chromosomes. The homologies between the human and porcine karyotypes are 
demarcated by these colour patterns on the pig chromosomes. Loci mapped to individual pig chromosomes at 
that time are arranged on the right side of each chromosome. Courtesy of Fronicke et al. (1996) (colour). 
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Plate 3. Porcine-human comparative map. The comparative map was based on human genome build 37 and porcine genome build 9, which represents 89% of the porcine 
genome. Colour codes refer to the 23 human chromosomes. 
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Plate 4. Chester White, male, USA. Photo: Mapes Livestock Photos, Milford Center, Ohio, USA. 
Plate 5. Duroc, female, USA. Photo: National Swine Registry, USA. 

Plate 6. Hampshire, female, USA. Photo: National Swine Registry, USA. 

Plate 7. Hereford, female, USA. Photo: Mapes Livestock Photos, Milford Center, Ohio, USA. 
Plate 8. Poland China, female, USA. Photo: Mapes Livestock Photos, Milford Center, Ohio, USA. 
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Plate 9. Spotted, female, USA. Photo: Mapes Livestock Photos, Milford Center, Ohio, USA. 

Plate 10. Yorkshire, female, USA. Photo: National Swine Registry, USA. 

Plate 11. Pampa-Rocha, female, Uruguay. Photo: Washington Bell, Universidad de la Republica, Uruguay. 
Plate 12. Piau, female, Brazil. Photo: Simone Guimaraes, Universidade Federal de Vigosa, Brazil. 

Plate 13. Berkshire, male, England. Photo: Mapes Livestock Photos, Milford Center, Ohio, USA. 
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Plate 14. Gloucestershire Old Spot, male, England. Photo: Palmer Holden, Iowa State University, USA. 
Plate 15. Middle White, female, England. Photo: David Merrett, Creative Commons, Daventry, UK. 
Plate 16. Tamworth, female, England. Photo: Mapes Livestock Photos, Milford Center, Ohio, USA. 
Plate 17. Angeln Saddleback, female, Germany. Photo: Hedwig von Ebbel, Wikimedia, Germany. 

Plate 18. Bentheim Black Pied, female, Germany. Photo: Garitzko, Wikimedia, Germany. 
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Plate 19. Landrace, female, Denmark. Photo: National Swine Registry, USA. 

Plate 20. Mangalitsa, female, Romania. Photo: Daniel Ciobanu, University of Nebraska, USA. 

Plate 21. Pietrain, male, Belgium. Photo: Jan Bielfeldt, Schweineherdbuchzucht Schleswig-Holstein E.V., Germany. 
Plate 22. Da Min, male, China. Photo: Max Rothschild, Iowa State University, USA. 

Plate 23. Fengjing, male, China. Photo: Max Rothschild, Iowa State University, USA. 
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Plate 24. Luchuan, female, China. Photo: Kenneth Stalder, Iowa State University, USA. 

Plate 25. Meishan, female, China. Photo: Max Rothschild, Iowa State University, USA. 

Plate 26. Mong Cai, male, Vietnam. Photo: Kenneth Stalder, Iowa State University, USA. 

Plate 27. Tongcheng, female, China. Photo: San-Ping Xu and Bin Fan, Huazhong Agricultural University, China. 
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Introduction 

The study of mammalian development has 
historically relied on morphological observa¬ 
tions and the evaluation of developing offspring 
in utero. This painstaking work has resulted in 
detailed and meticulous descriptions of the 
events and structures that arise during this ele¬ 
gantly orchestrated process (Fatten, 1948). 
The developmental genetics of the pig, as with 
many other mammals, was hindered for a long 
time owing to the tremendous complexity of 
development and the lack of appropriate meth¬ 
ods. However, recent advances in molecular 
techniques and gene expression profiling have 
resulted in the generation of a vast amount of 
data concerning the genetics of mammalian 
development. A great deal of this work has 
been carried out in the mouse, but there are 
increasing amounts of information on the 
embryology and genetics of development in 
economically important agricultural species 
such as the pig. Drawing on this information, 
this chapter is intended to provide an overview 
of the current understanding of mammalian 
development as it relates to pigs. 

Developmental Stages 
of the Pig Embryo 

Gametogenesis 

As in any mammal, the beginning of develop¬ 
ment in the pig, as well as the origin of its genetic 
material, lies in the gametes that come together 
to ultimately create a porcine conceptus. As a 
result, the development and developmental com¬ 
petence of the conceptus can be affected by 
gametogenesis. 

The ovary of newly born females contains 
a lifetime supply of oocytes stored in quiescent 
primordial follicles (for a full review see Hunter, 
2000; Picton et a/., 2008; Edson et a/., 2009). 
With oogenesis commencing in the female 
fetus, meiosis of oogonia begins as early as 
40 days after conception and, about 35 days 
after birth, all oocytes are formed and arrested 
at the prophase of the first meiotic division (i.e. 
the germinal vesicle stage). At puberty, and 
continuing throughout the reproductive life of 


the female, under appropriate hormonal con¬ 
trol, pools of primordial follicles are recruited 
to grow. In the pig, an estimated 500,000 
primordial follicles are present at birth and this 
number decreases slightly to approximately 
400,000 around puberty. Follicular develop¬ 
ment follows a well-coordinated series of events 
characterized by follicular and oocyte growth, 
as well as cell proliferation and differentiation. 
Morbeck et al. (1992) were able to determine 
the time needed for porcine follicular develop¬ 
ment from the initiation of primordial follicle 
growth to the pre-ovulatory stage; a primordial 
follicle is estimated to require approximately 
84 days before reaching this stage (Fig. 12.1). 
Another 14 days is then necessary for an early 
antral follicle to develop to a 3-mm follicle. 
Finally, based on an estimated growth rate of 
lmm/day for follicles larger than 3 mm, a 
3-mm follicle requires 5 days to reach pre¬ 
ovulatory status. Therefore, roughly 100 days 
are needed for a porcine primordial follicle to 
ovulate. Oocytes also grow throughout follicu¬ 
lar development, starting at less than 30 pm in 
primordial follicles and reaching an average of 
120 pm in the pre-ovulatory follicle. Oocyte 
growth is almost linear until the follicle reaches 
300 pm in diameter, after which the size of the 
oocyte remains relatively constant until ovula¬ 
tion. Oocyte growth is also associated with 
meiotic and developmental competence, as 
well as with the ability of the oocyte to undergo 
meiotic maturation, fertilization and develop¬ 
ment. In vivo , a surge of luteinizing hormone 
(LH) initiates oocyte maturation and induces 
ovulation of the oocyte. A key component to 
oocyte maturation is maturation promoting 
factor (MPF) kinases. In the pig, MPF activity is 
very low in the germinal vesicle stage, and 
peaks at metaphase I and II stages (Dedieu 
et a/., 1996). Other kinases, such as MOS and 
mitogen-activated protein (MAP), which are 
referred to as cytoplasmic factors (CFs), are 
also involved in the regulation of meiotic 
events. It has long been established that the 
cMos gene affects the block in the meiotic cell 
cycle at metaphase 2 in mammalian oocytes 
(Colledge et al., 1994). The Gs-coupled recep¬ 
tor, GPR3, has also been identified as an essen¬ 
tial regulator of meiotic arrest in the mouse 
oocyte (Mehlmann et al., 2004). Gpr3 RNA 
has been localized in oocytes, and oocytes 
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Fig. 12.1. Structures of a porcine antral (pre-ovulatory) ovarian follicle. From Patten (1948), with permission. 


from mice lacking the Gpr3 gene undergo 
spontaneous oocyte maturation within fully 
grown, intact follicles, independent of an 
increase in LH. Also, Andreu-Vieyra et a\. 
(2006) provide a comprehensive overview of 
mouse models that demonstrate the import¬ 
ance of oocyte-expressed genes in fertility, 
including the genes for: growth differentiation 
factor 9 (Gd/9); bone morphogenetic protein 
15 (Bmp 15); zona pellucida glycoproteins 
(Zpl, Zp2 , Zp3); helix-loop-helix transcription 
factor (Figla); connexins ( Gjal, Gja2 ); and the 
homeobox gene (Nobox). Although maturing 
oocytes are both transcription and translation 
active, the presence of a specific mRNA in the 
oocyte does not ensure that the protein is actu¬ 
ally expressed, particularly given that mamma¬ 
lian oocytes contain a large percentage of 
mRNA that is not polyadenylated and there¬ 
fore not translated. As a result, proteomic 
analysis is favoured over gene expression in 
identifying molecules that are differentially 
expressed during oocyte maturation (Vitale 

et al., 2007). 


Spermatogenesis is a cyclic and highly 
coordinated process that begins with sperma- 
togonial stem cells, which, in contrast to the 
process of oogenesis, continually proliferate 
and replenish the testicular seminiferous tubules 
by mitotic division to produce a virtually unlim¬ 
ited number of gametes. The spermatogonia 
proceed through two meiotic divisions, which 
are followed by spermiogenesis, in which hap¬ 
loid spermatids develop into spermatozoa 
(Merchant-Larios and Moreno-Mendoza, 
2001). All types of male germ cells are found 
in a single section of the seminiferous tubule 
epithelium. The cycle of the seminiferous epi¬ 
thelium refers to the complete progression 
through this series of cellular stages and is 
unique for each species, as is the duration of 
spermatogenesis. The entire progression of a 
seminiferous cycle in the pig is 8.3 days (Franca 
and Russell, 1998), while the complete pro¬ 
cess of spermatogenesis from spermatogonia 
A to a fully formed spermatozoa requires 
39 days in the boar (Franca and Russell, 1998). 
An important event related to the meiotic 
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division of spermatogenic cells occurs during 
the pre-leptotene phase, when complete DNA 
replication occurs, forming tetrads without 
separation. The resulting chiasmata facilitate 
chromosomal recombination, so that the 
prophase of the first meiotic division during 
spermatogenesis is a major source of genetic 
heterogeneity. Owing to the complexity and 
duration of the spermatogenic process, the 
underlying genetic mechanisms are not fully 
understood, particularly in domestic animals 
such as the pig; however a review by Yan 
(2009) reveals 20 different genes that, when 
deleted, adversely affect male fertility. 

During both oogenesis and spermatogen¬ 
esis, epigenetic mechanisms that regulate gene 
transcription are also active in the form of DNA 
methylation. The primordial germ cells which 
are the starting point of gametogenesis in the 
early embryo have highly methylated DNA 
(Reik et a/., 2001; Hajkova et a/., 2002). 
However, once these cells have populated the 
developing gonads they are generally hypo- 
methylated. In the pig, this demethylation 
process is complete by day 28 (D28) of devel¬ 
opment (Fetkov et a/., 2009). During game¬ 
togenesis there is a de novo methylation of the 
gametic genomes catalysed by DNA (cytosine-5)- 
methyltransferases (DNMTs) that regulate genes 
during development and play a role in genomic 
imprinting. Genomic methylation patterns are 
erased and reacquired differentially in the devel¬ 
oping male and female gametes, are further 
modified in the early embryo and become rela¬ 
tively stable by late embryogenesis. Sex-specific 
DNA methylation of particular domain 
sequences occurs, forming the basis for pater¬ 
nally and maternally imprinted genes (discussed 
later). It is proposed that the erasure and reset¬ 
ting of DNA methylation that take place as a 
part of gametogenesis are likely to be important 
in preventing DNA methylation defects from 
being passed from one generation to the next. 


Fertilization and embryonic development 
within the zona pellucida 

Fertilization occurs within a few hours of ovu¬ 
lation in an inseminated sow, and the initial 
cleavage to a two-cell embryo within 12-16 h 


post ovulation. For this review, day 0 (DO) of 
development is considered to be the day that 
ovulation and fertilization occur, and all other 
temporal reference to developmental stages 
use this as the starting point. The porcine con- 
ceptus then develops through a series of criti¬ 
cal stages prior to implantation. These steps 
and the timing of critical events in the prenatal 
development of the pig are summarized in 
Table 12.1. Following fertilization of an 
oocyte, the arrested meiotic process resumes, 
the second polar body is extruded into the 
perivitelline space, and then the male and 
female pronuclei form (Fig. 12.2a). The com¬ 
pletion of meiosis is facilitated by various sig¬ 
nal transduction pathways that converge to 
inactivate the MPF (Colledge etal., 1994). The 
MOS MAP kinase pathway is also considered 
to be responsible for the phosphorylation of 
spindlin (Oh et a/., 1997), which is associated 
with the meiotic spindle and the changing of 
the metaphase spindle into an anaphase con¬ 
figuration that requires the presence of cal¬ 
cium/calmodulin-dependent protein kinase II 
(CaM) and has been characterized in the por¬ 
cine oocyte. (Fan et a/., 2003; Xu et a/., 
2009). Oocytes of cMos knockout mice 
undergo spontaneous activation, owing to a 
lack of spindlin phosphorylation, which destab¬ 
ilizes metaphase II arrest (Hashimoto et a/., 
1994). Fertilization also triggers waves of 
increased calcium concentration passing 
through the cytoplasm, which is thought to be 
mediated by the soluble sperm protein oscillin 
(Parrington et a/., 1996). These calcium 
waves, also referred to as the calcium tran¬ 
sient, lead to a remodelling of the cytoplasm 
and nuclear compartments, suggesting that 
they play a role in the initiation of transcrip¬ 
tion. Study of the calcium transient in pigs has 
generally been related to the refining of in 
vitro fertilization procedures (Funahashi et a/., 
1995; Ito et a/., 2003), although it would 

appear that this is only one of the regulatory 
events that may proceed in waves through the 
cytoplasm at the time of fertilization. Additional 
factors are being discovered as promoting 
oocyte cytoplasmic and nuclear maturation, 
including the Sonic hedgehog signalling (Shh) 
pathway (Nguyen et a/., 2009). 

Gene expression studies in porcine gam¬ 
etes around the time of fertilization are limited, 
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Table 12.1. Essential events and timing of pig prenatal development. Compiled from: Patten (1948); 
Evans and Sack (1973); Hyttel and Niemann (1990); Schultz (2002); Blomberg et al. (2005); Vejlsted et al. 
(2006b); Oestrup et al. (2009). 

Day of development 3 Developmental stage/event 


Day 0 (14—16 h post ovulation) 3 

Day 1 

Day 2 

Days 2-3 

Day 3 

Day 5 

Day 6 

Days 7-8 

Days 9-10 

Days 11-12 


Day 12 
Days 13-14 
Days 14-15 


Day 16 

Days 17-18 

Day 19 

Days 20-21 

Day 22 
Days 24-26 
Day 28 

Day 30 
Days 34-35 
Day 44 
Days 46-49 
Days 112-116 


Cleavage to two-cell stage 
Cleavage to four-cell stage 

Development to four-to-eight cell stage; genome activation 
Transition from oviduct to uterus 
Blastomere compaction; morula development 
Blastocoel development 
Blastocyst expansion 
Blastocyst hatches from zona pellucida 
Conceptus is an expanded sphere; embryonic disc forms 
Pre-streak stage; maternal recognition of pregnancy; gastrulation 
begins; elongation of trophectoderm; primordial germ cell 
formation; beginning of implantation 
Primitive streak develops; gastrulation continues 
Open neural tube forms 

First somites apparent; neural tube developing and begins to 
close; optic vesicles apparent; attachment of conceptus to 
uterine endometrium; implantation is accomplished 
Neural tube closed; heart and auditory pits apparent; forelimb 
bud forming 

Amnion completely developed; optic vesicle present; hind limb 
bud forming 

Allantois fills and contacts the chorion; lens and optic cup well 
developed 

Olfactory pits present; eye pigments form; intestines herniated 
into umbilical cord 
End of somite formation 
Embryonic gonads visible 

Hair follicles appear; eyelids forming; external genitalia 
differentiate; digits forming 
Chorion fully vascularized by allantoic vessels 
Palate fused; facial clefts closed 
Prepuce, scrotum, labia and clitoris present 
Eyelids cover eyes 
Birth 


a Day 0 (DO) of development is considered to be the day that ovulation and fertilization occur, and all other temporal 
references to developmental stages use this as the starting point. 


but the presence of transcripts of the genes for 
clusterin ( CLU ), protamine 2 (PRM2), calmegin 
(CLGN), cAMP-response element modulator 
protein ( CREM ), methyltransferase 1 (DNMT1), 
linker histone 1 (HI), protamine 1 (PRM1), 
TATA box-binding protein associated factor 1 
(TAF1) and TATA box-binding protein (TBP) in 
porcine spermatozoa, mature oocytes, zygotes 
and two-cell stage embryos has been assessed 
(Kempisty et al., 2008). Transcripts for all of 
these genes were detected in spermatozoa, 
while oocytes contained only CREM, HI, 


TAF1 and TBP mRNAs. The zygote and two¬ 
cell stage embryos contained transcripts for the 
CLU, CREM, HI, PRM1, PRM2, TAF1 and 
TBP genes. This suggests that porcine sperm¬ 
atozoa may deliver CLU, PRM1 and PRM2 
mRNAs to the oocyte, and that these probably 
contribute to zygotic and early embryonic devel¬ 
opment. By 48 h of development (D2) the por¬ 
cine embryo is transformed from the two- to 
the four-cell stage (Fig. 12.2b and c) and, at the 
end of the third cell cycle, the transition from 
maternal to embryonic genome expression will 
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Fig. 12.2. Early pre-implantation pig embryos: (a) fertilized zygote with two polar bodies and sperm on zona 
pellucida; (b) two-cell stage embryo; (c) four-cell stage embryo; (d) eight-cell stage embryo; (e) blastocyst in 
zona pellucida; (f) hatched blastocyst. 
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commence (Brevini et a/., 2007). Subsequently, 
the embryo continues to cleave and initiates 
compaction as early as the 8- to 16-cell 
stage (Fig. 12.2d), and, by D2-3 of embryo 
development, this leads to formation of a com¬ 
pacted morula and transfer from the oviductal 
environment to that of the upper uterine horn. 
During the compaction process, blastomeres 
begin to flatten and cell-to-cell contacts become 
more pronounced; internal and external cells 
steadily differentiate and obtain some degree of 
polarity (Rossant and Tam, 2009). In the uter¬ 
ine environment, formation of the blastocyst, 
or blastulation, occurs. The blastocoel or inter¬ 
nal cavity develops next and, as a result, forma¬ 
tion of the blastocyst occurs during D5 of 
development (Fig. 12.2e). Within the following 
24 h, the blastocyst attains its typical structure 
and two distinct cell lineages: the inner cell 
mass (ICM) and trophoectoderm (TE). As a part 
of this process, the outer layers, closest to the 
zona pellucida, become connected by tight 
junctions and desmosomes to seal the expand¬ 
ing blastocyst cavity in which the ICM forms as 
a tight cluster of cells. The ICM cells communi¬ 
cate via gap junctions, facilitating the filling of 
the blastocoel and, in turn, expansion of the 
embryo (D6), until it ultimately hatches from 
the zona pellucida by D7-8 (Fig. 12.2f). 


Post-hatching development 

At the time of hatching, the TE makes up the 
majority of the external cells of the blastocyst; it 
will develop an epithelial phenotype, form much 
of the extra-embryonic tissue, and play a critical 
role during implantation and formation of the 
trophoblast layers of the placenta. The ICM 
makes up the remaining cells, and differentiates 
into the epiblast and the hypoblast (primitive 
endoderm). At D9 of development, there is a 
disintegration of the TE polar region covering 
the epiblast to create the structure known as the 
embryonic disc. By DIO the conceptus is 
enlarged (8-10 mm), but still spherical, and the 
embryonic disc is fully formed (Fig. 12.3). The 
embryo may be referred to as being at the pre¬ 
streak 1 stage at this point (Vejlsted et a/., 
2006a). During Dll-12, the embryonic disc 
transforms from a circle into an oval structure, 


and at one pole of the disc a prominent crescent¬ 
shaped thickening appears. This represents the 
pre-streak 2 stage and the first signs of anterior- 
posterior polarity of the embryo. At this point, a 
dramatic elongation of the conceptus occurs, in 
which it converts from a spherical to a long, fila¬ 
mentous structure, often over lm in length. 
The conceptus will also signal its presence dur¬ 
ing this period to allow for maternal recognition 
of pregnancy to occur in order that the corpora 
lutea (CL) are maintained and the uterine envi¬ 
ronment remains such that it will support and 
promote pregnancy. In the pig, blastocysts 
begin to produce oestrogens by Dll of devel¬ 
opment, which through a series of processes 
(discussed below) prevent secretion of the uter¬ 
ine luteolytic factor (PGF2 alpha) in an endo¬ 
crine direction, while allowing secretion in an 
exocrine direction (i.e. into the uterine lumen), 
thereby protecting the CL from luteolysis or 
regression (Spencer et a/., 2004). Then, by 
approximately D12-13 of development, the 
primitive streak appears at the posterior end of 
the embryonic disc (Fig. 12.4), corresponding 
to the onset of gastrulation. 

Gastrulation tends to initiate in parallel 
with elongation of the TE (Dll-12), and 
involves a complex sequence of cellular differ¬ 
entiation and movement that ultimately facili¬ 
tates the generation of uniquely distinct 
structures and tissues within the conceptus. 
The major result is the formation of the three 
primary germ layers: endoderm, mesoderm 
and ectoderm. The resulting ectoderm will 
eventually give rise to the nervous system and 
epidermis, the mesoderm develops into the 
cardiovascular, urogenital and muscular sys¬ 
tems, while the endoderm is the starting point 
for the digestive, pulmonary and endocrine 
systems. In addition to these somatic germ lay¬ 
ers, the primordial germ cells are also formed 
(Flechon et a/., 2004a). The initiation of gas¬ 
trulation as a developmental phase precedes 
neurulation, but its completion overlaps with 
this later process. The first sign of neurulation 
is a thickening of the anterior ectoderm as the 
primitive streak regresses and the formation of 
neural plate folds to become the neural groove 
(van Straaten et a/., 2000). In the pig, the 
neural groove develops at approximately 
D14-15, which also coincides with the initia¬ 
tion of segmentation (Fig. 12.5). 
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ICM Visceral 
endoderm 



Visceral endoderm Early mesodermal cells 


Fig. 12.3. (a) Porcine conceptus at DIO of development with embryonic disc clearly visible; (b) 

Schematic showing the loss polar TE (trophoectoderm) and emergence of the embryonic disc (ICM, inner 
cell mass); (c) Schematic showing stage of first mesenchyme cell migration. Figures (b) and (c) 
reproduced from Flechon et al. (2004b), with the kind permission of the authors. 


Segmentation is the developmental proc¬ 
ess that subdivides the body into a series of 
subunits that result in the different segments 
of the body. In vertebrates, the earliest form 
of segmentation is the development of 
somites, which result from a thickening of the 
mesoderm in the midline of the embryo to 
form blocks of mesodermal cells. The first 
somites in the pig are apparent on D14 of 
development (Vejlsted et a/., 2006b). In paral¬ 
lel to somite formation, the neural tube pro¬ 
gressively forms and begins to close by 
the five-to-seven somite stage (D15) and is 


complete by the 28-somite stage (D16-17) 
(van Straaten et a/., 2000). Soon after gastru- 
lation in porcine embryos, the endoderm 
germ layer forms a primitive gut tube, which 
subsequently leads to organ specification 
(foregut, midgut and hindgut), then formation 
of organ buds, and finally to more specialized 
cell lineages (D20-22). 

In the pig, fertilization of ovulated 
oocytes generally exceeds 95%, but 30-50% 
of these fertilized oocytes generally do not 
survive to parturition 112-116 days later 
(Geisert et al ., 1982). It has been estimated 
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Fig. 12.4. Drawing of pig conceptus at D12-13 of development; primitive steak appears at the posterior 
end of the embryonic disc. From Patten (1948), with permission. 


that 20-30% of porcine embryonic mortality 
occurs between D12 and D13, and another 
10-15% occurs by mid-gestation (Pope 
et al., 1982; Stroband and Van der Lende, 
1990). Another critical period is D20-30, as 
insufficient placental development during 
this stage influences subsequent fetal growth 
and survival in the uterine environment 
(Freking et al ., 2007). Tayade et al. (2007) 
propose there to be three critical events for 
successful porcine development and survival 


in utero : (i) activation of the embryonic 
genome occurs at the four-to-eight cell stage 
(D2); (ii) blastocyst expansion and develop¬ 
ment of embryonic polarity and morphologi¬ 
cal differentiation; and (iii) attachment of the 
elongating conceptus to the endometrium by 
D15 and subsequent establishment of the 
placenta to sustain the developing fetus 
(Geisert et al 1982). The remainder of the 
chapter will deal with the genetic control of 
these critical steps. 





272 


M. Dyck and A. Ruvinsky 


Yolk sac • 


Cephalic • 
amnionic fold 


Lateral 
amnionic fold 


Primitive 

streak 



Yolk sac 


Extra-embryonic 

somatopleure 


Neural plate 


Neural groove 


Somite 


Hensen’s node 


Caudal 
amnionic fold 


Fig. 12.5. Drawing of pig conceptus at D14-15 of development; neural groove developing and somite 
formation occurring. From Patten (1948), with permission. 


Genetic Control of Pre-implantation 

Development 

Expression of maternal genes 

As previously mentioned, unovulated porcine 
oocytes are arrested at the prophase of the 
first meiotic division, but both transcription 
and translation are very active and under 
‘maternal command’. Mouse oocytes express 
about 5400 genes and transposable elements, 
many of which are conserved in chordates 
(Evsikov et a/., 2006). Numerous newly syn¬ 
thesized mRNAs are stored and used later dur¬ 
ing oocyte maturation and up until embryonic 
genome activation (EGA), which occurs at the 


two-cell stage in the mouse (Hamatani et a/., 
2006) and the four-cell stage during pig devel¬ 
opment (Oestrup et a/., 2009). Meanwhile, 
depletion of maternal mRNA intensifies prior 
to fertilization and continues until activation of 
the embryonic genome. By this time, nearly 
90% of maternal mRNA is degraded and the 
majority of such transcripts are exclusively 
expressed from the oocyte genome 
(Bettegowda et a/., 2008). In the mouse, and 
probably also other mammals, including the 
pig, ‘housekeeping’ genes are under-repre¬ 
sented in the oocyte and early embryo transcrip- 
tomes. It has been suggested that this unique 
feature indicates that the core function of the 
oocyte is to act as a ‘reprogramming machine’ 
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to create a totipotent embryo (Evsikov and 
Marin de Evsikova, 2009). 

While current understanding of the transi¬ 
tion towards mature oocyte and embryonic 
development is only emerging, and some 
species-specific deviations are possible, it is 
useful to develop a more comprehensive view. 
Several genes, some identified recently, guide 
this process (Fig. 12.6). Among such genes is 
Eif41b, which is involved in translational 
repression of maternal mRNAs. In the mouse, 
an oocyte-specific mammalian form of eukary¬ 
otic translation initiation factor 4E coded by the 
Eif41b gene may influence the speed of oocyte 
maturation (Evsikov et a/., 2006). Another 
example is the inhibitory phosphorylation of 
the CDC2 protein with kinase activity, which is 
catalysed by pig Wee IB protein. This involves 
meiotic arrest of porcine oocytes. The inactiva¬ 
tion of WeelB gene, in combination with other 
factors, leads to the resumption of meiosis 
(Shimaoka et a/., 2009). In mature oocytes, the 
degradation of maternal transcripts becomes 
more prominent, and seems to be nearly com¬ 
pleted by the two-cell stage, when the so-called 
minor zygotic genome activation takes place. In 
fact, the ZAR1 gene (zygote arrest 1) is one of 
the few known oocyte-specific maternal-effect 
genes essential for the beginning of embryo 
development (Wu et a/., 2003). Surprisingly, 
some Zarl(-/~) mice are viable and look nor¬ 
mal. However, Zar !(-/-) females are infertile, 
probably as a result of the arrest of embryonic 


development in the majority of zygotes at the 
one-cell stage, and the fact that maternal and 
paternal genomes remain separate in such 
zygotes. These Zarl(-/-) embryos show a 
marked reduction in the synthesis of the tran¬ 
scription-requiring complex, with fewer than 
20% of them progressing to the two-cell stage, and 
none develop to the four-cell stage (Wu et a/., 
2003). This gene is evolutionary conserved, 
and the protein plays a role in transcription 
regulation during oocyte maturation and early 
post-fertilization development (Uzbekova et a/., 
2006). Several more maternal-effect genes, 
which were identified recently, are depicted in 
Fig. 12.6 and described in detail by Bettegowda 
et al. (2008). 

Contrary to what is known for other mam¬ 
mals, in porcine zygotes there is an absence of 
the DNA replication checkpoint, meaning that 
the onset of DNA replication occurs very early 
in the maternal pronucleus, before activation 
of the paternal pronucleus (Vackova et a/., 
2006). The first divisions of the mammalian 
embryo are largely controlled by proteins and 
transcripts stored during oogenesis and oocyte 
maturation; porcine embryonic development is 
no different in that sense. Zygote genome acti¬ 
vation (ZGA) in mice follows two stages: a 
minor one before cleavage and a major one at 
the two-cell stage and later (Hamatani et a/., 
2006). In the pig and other mammals, ZGA 
occurs slightly later, after the four-cell stage. 
Then the nucleoli, which are essential for 
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Fig. 12.6. Current knowledge on genetic regulation of the oocyte-to-embryo transition in mammals 
relevant to porcine development. Redrawn from Bettegowda et al. (2008), with modifications compiled 
from several sources, including Magnani and Cabot (2008, 2009); Shimaoka et al. (2009); Evsikov and 
Marin de Evsikova (2009). Question marks indicate those genes for which activities have not, as yet, 
been confirmed in porcine embryos. EGA, embryonic genome activation. 





























274 


M. Dyck and A. Ruvinsky 


ribosomal RNA (rRNA) and ribosome produc¬ 
tion, develop in order to support protein syn¬ 
thesis. After fertilization, structures resembling 
the nucleolar remnant are established in the 
pronuclei; they are engaged in the re-establish¬ 
ment of fibrillo-granular nucleoli during the 
major activation of the embryonic genome 
(Maddox-Hyttel et a/., 2007). 

It is well known that in Drosophila mela- 
nogaster and Caenorhabditis elegans gradi¬ 
ents of morphogens in the zygote and early 
embryo are crucial for establishing positional 
information (St Johnston and Nusslein-Volhard, 
1992; Nusslein-Volhard, 1996). These gradi¬ 
ents are essentially products of maternal gene 
expression. To what degree similar gradients 
and elements of the cytoskeleton are important 
during the earliest stage of mammalian devel¬ 
opment still is not clear. Increasing cell polarity 
was described at the eight-cell stage of mouse 
and rat development (Reeve, 1981; Gueth- 
Hallonet and Maro, 1992). Cell fate, control¬ 
led by positional information, seems reversible, 
and provides the developing embryo with a 
certain degree of flexibility. In cattle, cellular 
polarization occurred in some blastomeres at 
the 9- to 15-cell stage, but typical distinct polar¬ 
ity was not manifested until after the 16-cell 
stage, with approximately 40% polar cells per 
embryo (Koyama et a/., 1994). Chimeric 
murine embryos, constructed from two-cell 
stage blastomeres from which the animal or 
the vegetal poles have been removed, can 
develop into normal fertile adult mice. 
Although polarity of the post-implantation 
embryo can be traced back to the eight-cell 
stage and, in turn, to the organization of the 
oocyte, its role is not entirely clear (Ciemerych 
et a/., 2000). It seems that in mammals axis 
specification during oogenesis and through to 
the early stages of cleavage is under strong 
regulation. This is unlike what is observed in 
other metazoans, and may be related to viviparity 


(Evsikov and Marin de Evsikova, 2009). If so, 
then the gradients which are so important in 
insects and worms may not be crucial for the 
very early stages of mammalian development. 
The establishment of axial polarity during 
cleavage and blastocyst formation is consid¬ 
ered later (Johnson, 2009). 


Activation of the embryonic genome 

The first wave of EGA, the so called minor acti¬ 
vation, occurs primarily in a male pronucleus 
up until the two-cell stage and results in the 
synthesis of only a few specific polypeptides. 
The second wave, starting from the two-cell 
stage, leads to massive changes in the gene 
expression pattern. Data relevant to murine 
development show that the most significant 
activation of the embryonic genome takes 
place at the two-cell and particularly at the 
four-cell stages (Table 12.2). Many hundreds of 
other genes remain continually active during 
this period as well. In porcine embryos the 
transition to more intensive transcription is 
shifted towards the four-cell stage (Whitworth 
et a/., 2005). The transcriptional dynamics of 
some porcine embryonic genes at very early 
stages provide useful information. For instance, 
the ZP3 and ZP4 genes coding for the major 
components of the mucoprotein layer of the 
zona pellucida have very high levels of expres¬ 
sion in the germinal vesicle oocyte and pro¬ 
gressively decline at the four-cell and the 
blastocyst stages. A similar pattern, slightly 
shifted towards the blastocyst stage, was 
observed for the DNMT genes responsible for 
the DNA methyltransferase involved in the 
regulation of transcription and genomic imprint¬ 
ing (Ko et a/., 2005; Jeong et a/., 2009). Another 
porcine gene coding for prothymosin alpha, 
which is involved in chromatin remodelling, 


Table 12.2. Dynamics of stage-specific gene activation during early mouse development. Data are 
extracted from Hamatani etal. (2006), Fig. 2. 
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among other functions, peaks in activity at the 
four-cell stage and then steadily declines. 
Expression of the dihydrolipoamide dehydro¬ 
genase gene increases from the oocyte to four¬ 
cell and blastocyst stages, thereby reducing 
lipid and protein peroxidation. Published data 
also point out that the SMARCA2 and 
SMARCA4 genes, which are active at the early 
stages of development, play essential roles in 
controlling the expression of other genes dur¬ 
ing early mammalian embryogenesis (Magnani 
and Cabot, 2009). The expression of the actin 
coding gene (ACTB) in porcine embryos 
increases over 20-fold starting from the two-cell 
stage (Whitworth et a/., 2005). 

Significant activation of transcription is an 
essential prerequisite for the intensification of 
translation that follows. RPL23 , one among 80 
genes controlling ribosomal proteins, activates 
in the porcine embryo at the blastocyst stage 
before a major increase in translation (Whitworth 
et a/., 2005). Bjerregaard et al. (2004) demon¬ 
strated in the pre-implantation porcine embryo 
the nucleolus-related gene expression of several 
proteins involved in rRNA transcription 
(upstream binding factor, UBF1; topoisomer- 
ase I, TOPI ; RNA polymerase I, POLR1 ; and 
the RNA Pol I-associated factor PAF53, 
POLR1E) and processing (fibrillarin, FBL; 
nucleophosmin, NPM1; and nucleolin, NCL). 
The first significant activation of the activity of 
these proteins was observed at the four-cell 
stage, and it then increased significantly at the 
blastocyst stage. Another marker that has been 
used to characterize genome activation is elon¬ 
gation initiation factor 1A mRNA ( elFlA ). 
Magnani et al. (2008) observed activation of 
elFlA at the two-cell stage in IVF (in vitro- 
fertilized) embryos. As previously stated, activa¬ 
tion of the embryonic genome occurs at the 
four-to-eight cell stage (D2), while the dominant 
role of embryonic genes is established only after 
gastrulation (Dll-12) (de Vries et al., 2008). 


Reprogramming, methylation pattern 
and genomic imprinting 

During the first 24 h or so after fertilization, the 
mammalian oocyte and sperm undergo natural 
reprogramming that gives rise to a totipotent 


zygote (de Vries et al., 2008). Genomic repro¬ 
gramming is a complex process involving 
numerous mechanisms. Protein and mRNA 
molecules accumulated in the oocyte facilitate 
reprogramming through chromosome remod¬ 
elling, as well as differential utilization and 
degradation of mRNA. As previously men¬ 
tioned, DNA methylation is erased from 
chromatin very early during development, thus 
creating a critically important condition for the 
next cycle of life. In porcine embryos, demeth- 
ylation continues up to the eight-cell stage, 
which is followed by de novo methylation. 
Incomplete epigenetic reprogramming is 
common for nuclear-transfer generated 
embryos and contributes to the low efficiency 
of the cloning procedure (Dean et al., 2001). 

The following developmental stages lead 
to the occurrence of pluripotent cell types with 
narrowed potential. Gene-expression pro¬ 
grammes operating in these pluripotent cells 
steadily become more defined, the production 
of core transcription factors begins, and the 
expression of pluripotency-associated genes 
commences. At least three genes (OCT4, 
NANOG and SOX2) coding for transcription 
factors have been identified. They are respon¬ 
sible for activation of other genes essential for 
the maintenance of pluripotency and the tem¬ 
porary repression of genes required for further 
differentiation (de Vries et al., 2008). 

Genes that are required later in develop¬ 
ment are repressed by histone marks, which 
confer short-term, and therefore flexible, 
epigenetic silencing (Reik, 2007). As soon as 
demethylation is accomplished, a new wave of 
DNA methylation begins, and this leads to 
stable and long-term epigenetic silencing of 
certain genetic elements, such as transposons, 
imprinted genes and pluripotency-associated 
genes. Whether such DNA methylation epige¬ 
netic marks play a key role in determining cell 
and lineage commitment still remains an open 
question (Reik, 2007). 

Porcine development shows all the typical 
features discovered in other mammalian spe¬ 
cies, including reprogramming and epigenetic 
modification. For instance, methylation of the 
lysine residue 27 of histone H3 (H3K27), which 
is an essential epigenetic marker linked to 
transcriptional repression, was confirmed dur¬ 
ing cleavage in porcine embryos. Three genes 
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were described ( EED , EZH2 and SUZ12) that 
regulate the H3K27 methylation process start¬ 
ing from the four-cell stage. The data indicate 
that H3K27 trimethylation is an epigenetic 
marker of maternally derived chromatin which 
undergoes global remodelling (Park et ai, 
2009). The paternal genome in the pig is 
actively demethylated within several hours of 
fertilization, while the maternal genome, on 
the contrary, is demethylated passively by a 
replication-dependent mechanism after the 
two-cell embryo stage (Haaf, 2006). This type 
of demethylation is related to DNA repair pro¬ 
cesses (Reik, 2007). 

Gametic or genomic imprinting is a devel¬ 
opmental phenomenon typical for eutherian 
mammals and is based on the differential 
expression of maternal and paternal alleles in 
certain genes. These genes are essential for 
the regulation of embryonic and placental 
growth. In genes such as IGF2 , only the 
paternal allele is expressed (maternal imprint¬ 
ing); in contrast, in genes such as HI 9, only 
the maternal allele is expressed (paternal 
imprinting). Imprint acquisition occurs before 
fertilization and imprint propagation extends 
up until the morula-blastocyst stage (Shemer 
et a/., 1996). In HI9, the 2-kb region is 
methylated on the paternal allele during sper¬ 
matogenesis; the maternal allele has a differ¬ 
ent methylation pattern (Davis et a/., 1999). 
In pigs, HI 9 is exclusively expressed from the 
maternal allele in all major organs, similarly to 
what has been observed in other species. On 
the contrary, the majority of IGF2 transcripts 
are expressed paternally from promoters 2-4 
(Li et a/., 2008). 

The molecular mechanisms of gametic 
imprinting are still under investigation. It seems 
possible that primary gametic signals are not 
simply copied from the gametes, but rather 
that the methylation pattern typical of imprinted 
genes establishes gradually during early devel¬ 
opment (Shemer et a/., 1996). The regulatory 
elements that control genomic imprinting have 
differential epigenetic marking in oogenesis 
and spermatogenesis, which results in the 
parental allele-specific expression of imprinted 
genes during development and after birth (Feil, 
2009). Both DNA and histone methylation are 
essential for imprinting. The latest data also 
show that DNA methylation is involved in the 


acquisition and/or maintenance of histone 
methylation at imprinting control regions 

(Henckel et ai, 2009). 

The developmental function of gametic 
imprinting is also not absolutely clear, but an 
explanation proposed by Moore and Haig 
(1991) is widely accepted. This is based on the 
concept of genetic conflict arising during preg¬ 
nancy between maternally and paternally 
inherited genes. Thus, it is likely that gametic 
imprinting evolved in mammals to regulate 
intrauterine growth and to increase the safety 
of embryonic development. Lack of maternally 
or paternally derived alleles, or the abnormal 
expression of such alleles in a zygote, may lead 
to embryonic mortality and impose strict 
requirements on the stability of imprinting sig¬ 
nals. According to the database of imprinted 
genes (Imprinted Genes Catalogue, 2010), at 
the time of writing the number of such genes 
so far identified in the pig is 61. Imprinting was 
recently confirmed for DIRAS3, DLK1, H19, 
IGF2AS, NNAT , MFST , PFG10 , PHLDA2 , 
PFAGL1 , SGCF and SNORD107 pig genes 
(Bischoff et a/., 2009). The majority of 
imprinted genes in the pig (48) that have accu¬ 
mulated in the database were found in different 
quantitative trait loci (QTLs). While identifica¬ 
tion of these genes is still an objective for future 
studies, it is possible that information about 
QTL-related imprinting effects will sooner or 
later be adopted by selection programmes (de 
Vries et a/., 1994). 


Gene expression during blastocyst 
formation, expansion and elongation 

At the morula stage of development, activation 
of a few selected genes is critical for synthesis 
of morphogenetically important proteins such 
as actin and the actin-associated proteins like 
alpha-fodrin, vinculin, and E-cadherin (Reima 
et a/., 1993). These molecules are distributed 
evenly in blastomeres during early cleavage, 
but then gradually accumulated towards the 
blastocyst stage in the regions of intercellular 
contact (Reima et a/., 1993). This seems to be 
essential for development of tight gap junctions 
related to blastocoel formation, and is particu¬ 
larly relevant to the outer cell layer. 
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As outlined, formation of the blastocyst, 
including the blastocoel, ICM and TE within 
the zona pellucida, occurs on D5-6 of develop¬ 
ment. Despite the simplicity of the blastocyst 
structure, the mechanisms of its formation are 
still elusive. Three models - mosaic, positional 
and polarization - have been suggested and 
extensively studied (Johnson, 2009). As previ¬ 
ously discussed, the ICM differentiates into the 
epiblast and primitive endoderm. The epiblast 
gives rise to the embryo itself and also to some 
extra-embryonic tissues. The TE is responsible 
for the development of the remaining extra- 
embryonic tissues and plays a critical role dur¬ 
ing implantation and formation of the 
trophoblast layers of the placenta. It has been 
found that Cdx2, which encodes a caudal- 
related homeodomein protein, is a key regula¬ 
tor of the TE lineage (Rossant and Tam, 2009), 
even though expression of this gene begins 
earlier. Mouse gene Tead4 , which produces 
transcription enhancer, is tentatively consid¬ 
ered as an upstream factor relevant to Cdx2 ; 
Eomes , on the contrary, is a downstream- 
located factor (Rossant and Tam, 2009). The 
Cdx2 and Eomes proteins are restricted to the 
outer layer cells. These two genes, which are 
required for specifying the pluripotent cells of 
the ICM-like Oct4 (Nichols et al., 1998), Sox2, 
(Nichols et a/., 1998; Avilion et a/., 2003) and 
Nanog (Chambers et a/., 2003; Mitsui et a/., 
2003), are initially expressed in all blastomeres 
but progressively become restricted to the ICM 
cells after blastocyst formation. Adjaye et al. 
(2005) identified in developing human blasto¬ 
cysts marker transcripts specific to the ICM 

(e.g. OCT4/POU5F1, NANOG, HMGB1 and 
DPPA5) and TE (e.g. CDX2, ATP1B3, SFN 
and IPL). The emergence of pluripotent ICM 
and TE cell lineages from the morula is control¬ 
led by the metabolic and signalling pathways, 
which include, inter alia, WNT, mitogen- 
activated protein kinase, transforming growth 
factor-beta, NOTCH, integrin-mediated cell 
adhesion, phosphatidylinositol 3-kinase and 
apoptosis. 

Polarity of the cells in the blastocyst 
increases as a result of an accumulation of pro¬ 
tein kinase3, polarity protein Par3 and ezrin in 
the apical domain of blastomeres and apical 
membrane; other proteins, such as Lgl and 
Pari, are exclusively found in the basal portion 


of blastomeres (Rossant and Tam, 2009). 
Connexin proteins are differently expressed 
both temporally and spatially in the pig embryo, 
influencing the formation of gap junctions in the 
trophoblast and later controlling the exponential 
growth of the trophoblast in pre-implantation 
pig blastocysts (Flechon et al., 2004a). 

Although the rate of embryonic develop¬ 
ment differs in pigs and mice, there is a corre¬ 
lation between the developmental stage and 
cytoskeletal organization in both species. 
Likewise, in the expanded bovine blastocyst, 
the distribution of several cytoskeletal and 
cytoskeleton-related proteins appeared similar 
(Shehu et al., 1996). Extracellular fibronectin 
was first detected in the early blastocyst before 
differentiation of the primitive endoderm and, 
at this stage, was localized at the interface 
between the TE and the extra-embryonic endo¬ 
derm (Shehu et al., 1996). Cingulin, the tight 
junction peripheral membrane protein, also 
contributes to morphological differentiation in 
early mouse development, and it is likely that 
other mammals have the same gene. The syn¬ 
thesis of cingulin is tissue-specific in blastocysts, 
and is up-regulated in the TE and down- 
regulated in the ICM (Javed et al., 1993). 

It is commonly accepted that proto¬ 
oncogenes are involved in numerous processes 
of embryonic development and that they 
encode a series of nuclear transcription factors, 
intracellular signal transducers, growth factors 
and growth factor receptors. For example, acti¬ 
vation of the c-fos and c-jun proto-oncogenes 
in sheep conceptuses occurs during the period 
of rapid growth and elongation (Wu, 1996), 
and a similar pattern possibly occurs in porcine 
embryos. These proto-oncogenes are involved 
in the regulation of gene expression, cell prolif¬ 
eration and differentiation. 

Following hatching at D7-8, the blasto¬ 
cysts transform from a spherical to a tubular 
shape, the elongation process begins (Fatten, 
1948), ICM develops into the embryonic disc, 
and differentiation of the epiblast and the 
hypoblast starts. By Dll-12 of porcine 
development, when elongation and transfor¬ 
mation to a thin filamentous structure is accom¬ 
plished, the pattern of gene expression 
becomes rather complex (Ross et al., 2003; 
Blomberg et al., 2008). Ross et al. (2003) 
identified 142 genes that were differentially 
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expressed among spherical, tubular and 
filamentous conceptuses. It was also found that 
during the transition from tubular to early fila¬ 
mentous conceptuses, sadenosylhomocysteine 
hydrolase and heat shock cognate 7 0 kDa 
expression were significantly enhanced. 
Conceptus expression of OCT4 on Dll-12 
was approximately 7.7-fold and 11.6-fold 
greater than expression on D15 and D17, 
respectively. This result suggests that down- 
regulation of OCT4 may be important during 
conceptus expansion, following implantation 
and gastrulation in the pig (Spencer et a/., 
2006). Comparisons between ovoid and fila¬ 
mentous conceptuses showed elevated expres¬ 
sion of genes involved in steroidogenesis, such 
as cytochrome P-450scc (CYP11A1), aro- 
matase ( CYP19A ) and steroidogenic acute 
regulatory protein (STAR), and also oxidative 
stress response pathways (MGST1) and 
copper-zinc superoxide dismutase ( SOD1) 
(Blomberg and Zuelke, 2005). The involve¬ 
ment of oestrogen and interleukin-1-beta was 
also confirmed (Blomberg et a/., 2008). 
Comparisons between the spherical and fila¬ 
mentous conceptuses (D12), as well as filamen¬ 
tous D12 versus filamentous D14, showed that 
expressions of 482 and 232 genes, respect¬ 
ively, were statistically different. The most 
significant differences were observed in genes 
coding for interferon-y (IFNG), heat shock 
protein 27 kDa (HSFB1), angiomotin, B-cell 
linker (BLNK), chemokine ligand 14 (CXCL14), 
parathyroid hormone-like hormone (FTHLH) 
and maspin (Ross et ai, 2009). 

A comparison of bovine transcriptomes 
from the blastocyst (D7) and conceptus (D14) 
stages revealed that -500 genes were up- 
regulated between these developmental stages, 
and only 26 genes were down-regulated 
(Ushizawa et a/., 2004). 


Genetic control of gastrulation 

On Dll of development, at the stage preced¬ 
ing gastrulation (pre-streak), there is a height¬ 
ened rate of cell proliferation in the posterior 
section of the epiblast in the porcine conceptus. 
Migration of the cells that are proposed to be 
the precursors of the primitive streak and 


expression of the Brachyury (T) gene begin 
(Flechon et ai, 2004b). This gene belongs to 
the T-box family of genes, which contribute 
greatly to tissue specification, morphogenesis 
and organogenesis (Muller and Flerrmann, 
1997). Brachyury interacts with the Goosecoid 
gene (GSC), which codes for a homeobox pro¬ 
tein, whose very low expression in porcine 
embryos can be detected as early as D7-8 of 
development. Flowever, the intensity of expres¬ 
sion becomes more pronounced by D12, par¬ 
ticularly in differentiating mesodermal cells that 
ingress from the epiblast via Hensen’s node 
(van de Pavert etal., 2001). This process finally 
leads to formation of the mesoderm and embry¬ 
onic endoderm. Goosecoid overexpression 
may repress the Brahyury gene and affect nor¬ 
mal development (Boucher et a/., 2000). In 
porcine embryos at the expanding hatched 
blastocyst stage, OCT4 is confined to the inner 
cell mass. Following separation of the hypo¬ 
blast and formation of the embryonic disc, this 
marker of pluripotency was selectively observed 
in the epiblast. Progressive differentiation of 
germ layers and tissues leads to silencing of this 
gene, with exception of its expression in the 
primordial germ cells (Vejlsted et ai, 2006a). 

The migratory cells converge at the mid¬ 
line of the posterior part of the epiblast, which 
creates a thickened longitudinal band known 
as the primitive streak (Patten, 1948; van de 
Pavert et ai, 2001). At the pre-streak stage, 
which precedes gastrulation and migration of 
the extra-embryonic mesoderm, the embry¬ 
onic disc becomes polarized (Flechon et ai, 
2004b). The early primitive streak is charac¬ 
terized by both high pseudostratified epithe¬ 
lium with an almost continuous but unusually 
thick basement membrane, and Brachyury 
expression. Brachyury is crucial for notochord 
development in all examined chordates, and at 
least 44 notochord-expressed genes are its 
transcriptional targets (Capellini et ai, 2008). 
Expression of the NODAL gene is essential 
for axial patterning during early mammalian 
gastrulation, as well as for induction of the 
dorso-anterior and ventral mesoderm (Jones 
et ai, 1995). Interestingly, the round shape 
and gradual posterior displacement of the por¬ 
cine anterior pre-gastrulation differentiation 
appears to be species specific, and correlates 
with development of the primitive streak and 
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extra-embryonic mesoderm (Hassoun et a/., 
2009). As gastrulation proceeds, the primitive 
streak extends anteriorly, and at its distal end 
the Hensen’s node is developed; this is com¬ 
posed of a mass of epithelium-like cells without 
cilia (Blum et a/., 2007) (Fig. 12.4). Expression 
of the Goosecoid gene is typical for these cells, 
which are the origin of the notochord. 

The notochord is a flexible fibro-cellular 
cord lying ventral to the developing central 
nervous system, and represents the major axial 
structure of the embryo, which plays an 
important role in the induction of the neural 
plate, chondrogenesis and somite formation. 
Glycoproteins compose a core of the noto¬ 
chord, with cells encased in a sheath of collagen 
fibres. Two genes controlling notochord forma¬ 
tion encode laminin bl and laminin gl, and are 
essential for building the scaffold on which 
individual cells organize the rod-like structure 
typical of the notochord (Parsons et al., 2002). 
Higher protein production of integrin subunits 
that regulate interactions with collagens and 
laminin is known for notochordal cells (Chen 
et a/., 2006). In vertebrates, the notochord is 
replaced during development by the vertebrate 
column. The notochord grows anteriorly from 
the Hensen’s node below the embryonic disc, 
and is composed of cells derived from a certain 
kind of differentiating mesodermal cells that 
ingress from the epiblast. Three key porcine 
genes, SOX17, NODAL and Brachyury (T), 
are involved in early development of the axial 
structure during gastrulation (Hassoun et a I., 
2009). According to Zorn and Wells (2009) the 
Nodal signalling pathway is necessary and 
sufficient for initiation of endoderm and meso¬ 
derm development, and it is required for proper 
gastrulation and axial patterning. Nodal ligands 
are members of the TGFp family of secreted 
growth factors. NOTO is another gene that is 
required for the formation of the caudal part of 
the notochord, as well as for ciliogenesis in the 
posterior notochord. The data also show that 
No to acts during murine development as a 
transcription factor upstream of Foxjl and 
Rfx3. According to Beckers et al. (2007), this 
genetic cascade is important for the expression 
of the multiple proteins required for cilia forma¬ 
tion and function. Later, these processes influ¬ 
ence dorsal and ventral axis specification and 
neural tube and spinal cord patterning. 


Clearly, the activation of the nuclear genes 
responsible for basic morphogenetic rearrange¬ 
ments is the prerequisite for notochord forma¬ 
tion and development. The T gene, which was 
first described as the Brachyury mutation in 
mice 80 years ago, is an important participant of 
the events required for the differentiation of the 
notochord and the formation of the mesoderm 
during posterior development. The T protein is 
located in the cell nuclei and acts as a tissue- 
specific transcription factor (Kispert et a l., 
1995). Cloning and sequencing of the T gene 
led to the discovery of the T-box gene family, 
which is characterized by a conserved sequence 
called T-box (Bollag et a/., 1994). This ancient 
family of transcription factors, which underwent 
duplication around 400 million years ago, is 
common to all vertebrates (Ruvinsky and Silver, 
1997). There are indications that several murine 
T-box genes are essential for the formation of 
different mesodermal cell subpopulations, and 
one of the T-box genes is essential for the devel¬ 
opment of early endoderm occurring during 
gastrulation (Fapaioannou, 1997). Involvement 
of the T-box genes Tbx2-Tbx5 in vertebrate 
limb specification and development has also 
been demonstrated (Gibson-Brown et al ., 1998). 
Formation of the notochord leads to several key 
ontogenetic events, including induction of the 
neural tube and then the central nervous system. 
A putative morphogen, Shh, secreted by the 
floor plate and notochord, specifies the fate of 
multiple cell types in the ventral aspect of the 
vertebrate nervous system and motoneurons. 
Shh, in turn, induces expression of the onco¬ 
gene Glil, which affects later development of 
the dorsal midbrain and hindbrain (Hynes et al., 
1997). Expression of the SHH gene is also 
important for establishment of the ventral pole 
of the embryonic dorsal-ventral axis (Echelard 
et al., 1993). Unlike the notochord cells, other 
emerging mesodermal cells spread out more or 
less uniformly and give rise to numerous organs 
and structures. 


Establishment of axial identity 

The most recent review on the development 
of axial polarity indicates difficulties in estab¬ 
lishing the initial causes of polarity in the 
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very early stages of mammalian development 
(Johnson, 2009). It is obvious, however, that 
the early blastocyst, and possibly even the 
16-cell stage, have some degree of polariza¬ 
tion and emergent axial identity. Several 
genes that significantly contribute to the 
emerging polarity have been identified so far. 
Genes encoding ezrin, the PAR family 
proteins and CDX2 are probably the key 
regulators. Other proteins, such as CDC42, 
E-cadherin, p-catenin and Hippo, are strongly 
involved in the process, and laminin and 
integrins play some role (Johnson, 2009). 
Development of the primitive streak and the 
notochord is the convincing demonstration 
that both anterior-posterior (AP) and dorsal- 
ventral (DV) axes are strictly determined. 

The left-right (LR) axis may look like an 
automatic consequence from the earlier- 
defined AP and DV axes, as it is perpendicu¬ 
lar to both (Levin, 2004). While the AP and 
DV axes can be influenced by gravity or 
sperm entry point, there are no known fac¬ 
tors that differentiate left from right. If this is 
so, it may mean that the LR symmetry exist¬ 
ing in mammals is effectively caused by the 
AP and DV axes. However, the cause of LR 
asymmetry in vertebrates, and in mammals 
in particular, is a difficult question. Levin 
(2004) compiled a long list of genes which 
may affect the symmetry. More recent find¬ 
ings show that in the developing mouse 
embryo, leftward fluid flow on the ventral 
side of Hensen’s node determines LR asym¬ 
metry. Morphological analyses of the node 
cilia demonstrated that the cilia stand not 
perpendicular to the node surface, but tilted 
posteriorly (Nonaka et a/., 2005). This mor¬ 
phological asymmetry can produce leftward 
flow. A genetic cause of LR asymmetries of 
the internal organs in vertebrates is steadily 
becoming clearer. Gros et al. (2009) consid¬ 
ered two possibilities. The first is that initial 
asymmetric cell rearrangements in chick 
embryos create a leftward movement of cells 
around the Hensen’s node; this is relevant to 
expression of Shh and Fgf8 (fibroblast 
growth factor 8). The alternative is a passive 
effect of cell movements. It has also been 
shown that a Nodal-BMP signalling cascade 
drives LR heart morphogenesis by regulating 
the speed and direction of cardiomyocyte 


movement (Medeiros de Campos-Baptista 
et al., 2008). 


Trophoblast gene expression 

In the pig, the TE makes up the majority of the 
external cells of the blastocyst, and this portion 
of the conceptus expands dramatically and elon¬ 
gates along the villous folds of the uterus. The 
TE, therefore, provides the functional point of 
fetal and maternal contact, which is known as 
the trophoblast. Differentiation of the tropho¬ 
blast begins early in embryonic development and 
ultimately results in functionally diverse cells. 
Again, much of the work on trophoblast gene 
expression has been done in the murine model, 
whose placental tissues and placentation are 
quite different from those of the pig. Roberts 
et al. (2004) describe this process, and consider 
that a key first step in trophoblast differentiation 
is the down-regulation of OCT4, which normally 
acts as a negative regulator of genes required for 
further differentiation (de Vries et al., 2008). 
OCT 4 acts in the pluriplotent ICM to silence 
genes related to differentiation but, once this 
restraint is removed, these genes, discussed 
below, can come under the control of transcrip¬ 
tional activators. Knofler et al. (2001) provide a 
comprehensive overview of the key regulatory 
factors involved in trophoblast development and 
differentiation. Of these, the T-box gene Eomes, 
which is considered to be one of the earliest tro- 
phoblast-determining factors in the pre¬ 
implantation embryo, seems to be required for 
trophoblast differentiation (Russ et al., 2000). 
Both Eomes and the homeodomain protein 
CDX2 are absent in the ICM, but are present in 
the TE (Beck et al., 1995). Cdx2 and Eomes 
murine knockout embryos fail to implant, and only 
develop to the blastocyst stage (Chawengsak- 
sophak et al., 1997; Russ et al., 2000). 

A family of transcription factors of basic 
helix-loop-helix (bHLH) proteins have been 
shown to be important in trophoblast develop¬ 
ment. In the mouse, this includes Mash-2 (Ascl2), 
whose expression is crucial in the specification of 
the trophoblast lineage, and particularly in 
spongiotrophoblast development. The transcrip¬ 
tion factor encoded by the Hxt gene is also 
expressed in the trophoblast, and is considered 
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to have a positive role in promoting the forma¬ 
tion of trophoblast giant cells in the mouse (Cross 
et ah, 1995). This family of factors also includes 
Hand 1, which is important for trophoblast giant 
cell formation in the mouse. Mice lacking the 
Handl gene show defects in the development 
of these cells (Riley et a/., 1998); Handl expres¬ 
sion may also be related to the regulation of 
Mash-2 (Ascl2) (Scott et ah, 2000). Although 
ruminants do develop giant binucleate cells in 
their placentas, structures equivalent to spongio- 
trophoblast or trophoblast giant cells do not 
appear to exist in the pig. 

There are various transcription factors that 
are widely expressed in embryonic, fetal and adult 
tissues that, based on knockout studies, seem to 
be necessary for placental development, as their 
deletion is consistently associated with trophob¬ 
last abnormalities. These include ETS2 (Yamamoto 
et ah, 1998), API (Schorpp-Kistner et ah, 1999; 
Schreiber et ah, 2000) and AP2gamma (Auman 
et ah, 2002; Werling and Schorle, 2002). Schultz 
et ah (1997) also described the genetic determi¬ 
nation of integrin trafficking, which regulates 
adhesion to fibronectin during differentiation of 
the mouse peri-implantation blastocyst. In addi¬ 
tion, the regulation of several metalloproteinases 
and their corresponding genes may also shed 
additional light on the process of implantation 
and further trophoblast development (Bass et ah, 
1997). As the trophoblast forms and matures, it 
eventually produces a series of products and hor¬ 
mones, including, but not limited to, growth fac¬ 
tors, interferons and pregnancy-associated 
glycoproteins, which will be discussed later. Given 
that placental development in rodents is dramati¬ 
cally different from that of the pig, it is important 
to determine whether these murine trophoblast 
genes are relevant to porcine trophoblast 
development. 


Genetic Regulation of Implantation, 
Maternal Recognition of Pregnancy 

and Placentation 

Implantation and placental development 

As with other ungulates, the porcine conceptus 
remains unattached in the uterine lumen for a 
relatively extended period, and is considered to 


associate with the uterine endometrium as of 
Dll-12 of development, with attachment 
being complete by D15 (Geisert et a/., 1982). 
During this initial period of free movement, the 
developing embryos are uniformly distributed 
in the two uterine horns. Also, the embryo 
produces significant amounts of oestrogens 
and interferons during this pre-implantation 
period. Unlike mice or humans, the porcine 
placenta is not highly invasive and relies on the 
elongated conceptus to establish maximal sur¬ 
face contact with the uterine endometrium. 
Formation of the extra-embryonic membranes 
is an obligatory step in establishing the ability 
of the conceptus to attach to and interact with 
the uterus. The membranes that originate from 
the primary germ layers include the yolk sac, 
chorion (serosa), amnion and allantois. The 
yolk sac is formed from the ICM, which evagi- 
nates to create a cavity in the developing 
embryonic gut, which becomes the sac. This 
transient structure is visible shortly after gastru- 
lation and regresses in size as the conceptus 
develops. The amnion and chorion both form 
the primitive endoderm and mesoderm. The 
amnion is a fluid-filled membrane that sur¬ 
rounds the developing embryo, while the chor¬ 
ion is the outermost extra-embryonic 
membrane, which will ultimately interact with 
the uterine endometrium. The first indication of 
the amnion is in the form of the amnionic folds, 
which are evident shortly after the primitive- 
streak stage; it then quickly develops into a fluid- 
filled membrane that encases the developing 
embryo, and amnion formation is complete by 
D18 of development (Friess et ah, 1980). As 
the amnion is developing, the allantois emerges 
as a sac-like evagination from the primitive gut. 
As the embryo grows, the allantois fills and 
eventually contacts the chorion, usually by D19 
in the pig. Continued and increasing contact 
between the chorion and allantois, as well as a 
period of rapid angiogenesis, results in the 
fusion of these two membranes and infiltration 
of the chorion by allantoic vessels. By D30, the 
chorion is extensively vascularized by allantoic 
vessels, resulting in the formation of the 
allantochorion membrane. At this point the 
allantochorion does not truly implant, but inter¬ 
acts with the endometrial glands. Therefore, 
the porcine placenta is classified as a diffuse 
epithelial-chorial type with numerous folds that 
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interlock into the endometrial folds, and six 
layers of tissue between the maternal and fetal 
tissues. As previously mentioned, this D20-30 
period is critical in pigs, as insufficient placen¬ 
tal development during this period influences 
subsequent fetal growth and survival in the 
uterine environment. The placenta continues 
to grow until D60-70, after which its growth 
rate decreases (Freking et a/., 2007). By D85, 
the placenta folds deepen and become more 
complex and, once the placenta is fully estab¬ 
lished, both fetal and maternal mitogenic and 
morphogenic factors continue to support 
development until parturition. 


Maternal recognition of pregnancy 

The mechanisms involved in the maternal 
recognition of pregnancy in pigs were first 
described by Bazer and Thatcher (1977). The 
underlying process in this event is the produc¬ 
tion of FGF2 alpha by the uterine endometrium; 
this is luteolytic and will, therefore, cause 
regression of the progesterone-producing CL. 
In order to maintain the CL and the progester¬ 
one levels necessary to support pregnancy, 
the conceptus will produce oestrogens that are 
anti-luteolytic. Therefore, it is believed that the 
oestrogens produced by the conceptus 
between Dll and D12 provide the signal for 
maternal recognition of pregnancy in swine. 
A second period of oestrogen production also 
occurs between D15 and D30, and is consid¬ 
ered important for CL maintenance. As men¬ 
tioned above, developmental expression of 
the key genes involved in steroidogenesis have 
been studied in the pig, and the filamentous 
conceptus shows elevated expression of 

CYP11A1, CYP19A, STAR, MGST1 and 
SOD1 (Blomberg and Zuelke, 2005). These 
oestrogens influence the expression of recep¬ 
tors for maternal hormones (e.g. prolactin) in 
the uterine tissues which, in turn, alter the 
secretion of FGF2 (Young et a/., 1989, 1990). 
Although it is well established that oestrogen 
secretion is essential for the maternal recogni¬ 
tion of pregnancy in pigs, other secretory 
proteins from the conceptus and endometrium 
are important for the appropriate develop¬ 
ment and survival. For a comprehensive review 


of conceptus-uterus interactions in the pig see 
Johnson et al. (2009). 


Cytokines and implantation 

As described in a recent review by Bazer et al. 
(2009), there are numerous strategies for 
implantation in mammals. Uterine receptivity 
to implantation involves the expression of 
interferon (IFN)-stimulated genes (ISGs), and 
these genes have many roles. A comprehen¬ 
sive list of ISGs is available from the Interferon 
Stimulated Gene Database (http://www.lerner. 
ccf.org/labs/williams/xchip-html.cgi). IFNs are 
pro-inflammatory cytokines secreted in the 
uterus during early pregnancy. They are cell¬ 
signalling proteins and, in most cases, the 
action of IFNs on ISGs is preceded by induc¬ 
tion of the genes by progesterone, which is 
elevated during pregnancy. This process has 
been well described in several species (Bazer et 
al., 2008). In the pig, the conceptus is able to 
secrete two major types of IFN: Type I IFN 
delta (SPI-IFN) and Type II IFN gamma (IFNG). 
Between D13 and D20 of development, SPI- 
IFN and IFNG transcripts are detectable 
(Cencic and La Bonnardiere, 2002; Joyce 
et al., 2007a). In ruminants, IFN tau (IFNT) is 
well established as the factor expressed by the 
conceptus that acts as the signal for recogni¬ 
tion of pregnancy, and therefore impedes 
regression of the CL until the end of preg¬ 
nancy. In the pig conceptus, IFNs do not show 
anti-luteolytic effects that alter the length of the 
oestrus cycle or the concentration of circulat¬ 
ing progesterone (Cencic and La Bonnardiere, 
2002; Cencic et al., 2003). However, beyond 
the initial priming effects of progesterone, 
interactions of IFNs and oestrogen regulate 
cell-specific gene expression of various genes 
in the endometrium, and orchestrate the inter¬ 
actions between the conceptus and the uterine 
environment. Several genes are induced by 
oestrogen in the uterine luminal epithelium by 
D12, including IFN regulatory factor 2 (IRF2). 
IRF2 is a regulator of transcription for various 
IFN-influenced genes that are likely to play 
roles in the establishment of pregnancy, includ¬ 
ing cellular proliferation, attachment and devel¬ 
opment of the conceptus (Ka et al., 2001; 
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Hicks et ai, 2003; White et a/., 2005; Joyce 
et a/., 2007a). In pigs and sheep, IRF2 expres¬ 
sion may restrict expression of most IFN- 
stimulated genes to the endometrial stroma 
and glandular epithelium. By D15, IFNs up- 
regulate a large array of genes in the uterine 
stroma and glandular epithelium, including: IFN 
stimulated gene 15 (ISG15); the interferon- 
related developmental regulator 1 gene (IRF1); 
the signal transducer and activator of transcrip¬ 
tion gene (STAT1); the swine leucocyte anti¬ 
gen genes (B1PJV4 and Q9TSW4); and the 
beta 2 microglobulin gene (B2MG) (Hicks et 
a/., 2003; Joyce et a/., 2007a,b, 2008). The 
role of these pregnancy-specific IFN-stimulated 
uterine genes is unclear, but it could involve: (i) 
protecting the conceptus from immune rejec¬ 
tion; (ii) limiting the ability of the conceptus to 
invade the endometrium; and (iii) stimulating 
uterine/placental angiogenesis. 

There are various other cytokines whose 
mRNA expression has been detected in con¬ 
ceptus and placental tissues, including: inter- 
leukin-6 (IL-6) (Mathialagan et a/., 1992), 
interleukin-1 beta (IL-1B) (Tuo et ai, 1996), 
colony stimulating factor-I (CSF-I) (Tuo et ai, 
1995) and leukaemia inhibitory factor (LIF) 
(Anegon et ai, 1994). More recently, the 
expression of a family of small cytokines, 
referred to as chemokines, has been detected 
in the reproductive tissues of pregnant mice, 
humans and pigs (Girard et ai, 1999; Townson 
and Nibbs, 2002; Martinez de la Torre et ai, 
2007; Wessels et ai, 2007). Although various 
functions for these cytokines have been pro¬ 
posed, their role in mice and humans may not 
be the same in the pig, in which the type of 
placentation and degree of invasiveness of 
implantation are quite different. Therefore, the 
relevance of these pro-inflammatory cytokines 
to the success or failure of pregnancy in the 
mouse and human may be very different from 
that in the pig (Croy et ai, 2009). 


Insulin-like growth factors 

Insulin-like growth factors (IGFs) are pleiotropic 
growth factors that are required for uterine and 
conceptus growth and development (Simmen 
et ai, 1990). Insulin-like growth factors 1 and 


2 (IGF1 and IGF2) are small polypeptides that 
promote cellular differentiation, proliferation 
and migration, and inhibit apoptosis. These 
processes are involved in the extensive remod¬ 
elling that occurs during the development of 
the placenta and its endometrial attachment 
site. The IGFs bind with high affinity to IGF 
receptors, namely IGF1R and IGF2R. IGF1R is 
a member of the tyrosine kinase family and is 
structurally related to the insulin receptor 
(Jones and Clemmons, 1995). IGF1R binds 
with equal affinity to both IGF1 and IGF2, 
whereas IGF2R binds only IGF2 with high 
affinity (Poliak, 2008). The bioavailability and 
biological actions of IGFs are regulated by at 
least six insulin-like growth factor binding pro¬ 
teins (IGFBPs) (Clemmons, 1997), which may 
augment or inhibit IGF action, and several pro¬ 
teases cleave IGFBPs, reducing or eliminating 
their ability to bind IGFs. 

The IGF system has been extensively stud¬ 
ied during the porcine oestrus cycle and in 
early pregnancy (Simmen et ai, 1992; 
Ashworth etai, 2005). IGF1 mRNA decreases 
while IGF2 mRNA increases as pregnancy 
advances, with the highest level found in the 
placenta, followed by the endometrium and 
myometrium (Simmen et ai, 1992). There is 
a spatio-temporal relationship between 
increased uterine IGF1 and IGF2 and oestro¬ 
gen synthesis in the elongating conceptuses at 
DIO-13 (Letcher et ai, 1989; Geisert et ai, 
2001). Increased uterine IGF is associated with 
the expression of CYP11A1 and CYP19A by 
the conceptus, and therefore by oestrogen 
synthesis (Green et ai, 1995). Ashworth et ai 
(2005) documented premature loss of IGFs 
during the period of conceptus elongation 
following early exposure of pregnant gilts to 
oestrogen. The loss of IGFs during this period 
was attributed to proteolysis of IGFBPs due to 
endocrine disruption (Ashworth et ai, 2005). 
Recent research has been conducted to address 
whether IGF1 and IGF2 are directly linked 
with porcine conceptus development at D20 
and D50 (Croy et ai, 2009) in which expres¬ 
sion of the IGF1 and IGF2 genes was assessed. 
At D50, trophoblasts had more IGF transcripts 
than D50 endometrial samples, while, in D20 
endometrial samples, IGF1 transcripts was 
more abundant than in D50 endometrium. 
These data are consistent with other reports 
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that IGF1 expression declines as pregnancy 
advances (Simmen et a/., 1992; Ashworth 
et a/., 2005). IGF2 transcripts were tenfold 
higher in both endometrial and trophoblast 
samples than IGF1 transcripts. These results 
are in agreement with previous reports that 
IGF2 is predominantly expressed over IGF1 
during porcine pregnancy (Ashworth et a/., 
2005). This is also consistent with a series of 
gene knockout studies in mice that clearly 
show that IGF2 rather than IGF1 plays impor¬ 
tant roles in mouse placental and fetal devel¬ 
opment (Baker et a/., 1993; Liu et a/., 1993). 
Given that IGF2 and its receptor gene (IGF2R) 
are both imprinted genes, it has been shown 
that only disruption of the paternal IGF2 and/ 
or the maternal IGF2R genes will affect embry¬ 
onic growth and development (DeChiara 
et a/., 1990; Filson et a/., 1993). 


Angiogenesis 

Angiogenesis, the process by which new blood 
vessels are generated from an existing vascular 
system, occurs extensively during pregnancy to 
support the conceptus. A variety of factors 
support angiogenesis, but vascular endothelial 
growth factor (VEGF) and it receptors (VEGFRI, 
VEGFRII, VEGFRIII) appear to be of primary 
importance (Loges et a/., 2009). Of these 
receptors, VEGFRII is dominant, and placenta 
growth factor (PGF) is a high-affinity ligand for 
VEGFRI (Carmeliet et a/., 2001). Angiogenic 
gene profiles have been studied for VEGF, 
PCF, VEGFRI and VEGFRII in porcine 
endometrial tissues, and in the trophoblast 
associated with the healthy conceptus and with 
conceptuses experiencing arrested develop¬ 
ment (Tayade et a/., 2006; Linton et a/., 2008). 
At D20 and D50 of development, fewer VEGF 
transcripts were detected in the endometrial 
tissues associated with the arrested concep¬ 
tuses compared with the healthy ones, but the 
amount of VEGF transcripts in the trophoblast 
were not different. Also, from D15 to D28 of 
porcine development, there is dramatic onset 
of angiogenic activity that coincides with ele¬ 
vated numbers of a unique lymphocyte type 
referred to as uterine natural killer cells (uNK) 
(Engelhardt et aL, 2002). When endometrial 


lymphocytes from these same D20 and D50 
conceptuses were screened for angiogenic 
gene expression, the endometrial lymphocytes 
were found to have a greater abundance of the 
VEGF transcripts than the endometrial 
endothelium or the trophoblasts. However, for 
the conceptuses demonstrating arrested devel¬ 
opment, their attachment sites show severely 
reduced VEGF expression and an increase in 
PGF expression by the lymphocytes. The uter¬ 
ine lymphocytes preferentially expressed 
VEGFRI, while the trophoblasts were abun¬ 
dant in VEGFII transcripts, indicating that 
mechanisms that regulate angiogenesis differ 
between the maternal and embryonic/fetal 
compartments (Tayade et aL, 2007). This 
molecular evaluation of the porcine conceptus 
attachment sites shows a clear role for immune 
cells in the acceleration of angiogenesis in 
these tissues. 


Pregnancy-associated glycoproteins 

(PAGs) 

The pregnancy-associated glycoproteins (PAGs) 
belong to a group of aspartic proteinases that 
include pepsins (gastric enzymes) and cathep- 
sins D and E (lysosomal enzymes) (Davies, 
1990). The production of PAGs and PAG-like 
factors has been identified in various species 
during pregnancy, and their expression con¬ 
sistently initiates at the time of implantation 
and continues in the TE as pregnancy proceeds 
(Szafranska et aL, 1995). In ruminants, the 
expression of several bovine PAG [BPAG) tran¬ 
scripts has been identified in binuclear cells, 
while other distinct BP AG transcripts are 
expressed throughout the TE (Green et aL, 
2000). Porcine PAGs have been localized in 
the chorionic tissues (Majewska et aL, 2006). 
To date, at least 66 distinct PAG complemen¬ 
tary DNAs (cDNAs) have been cloned from 
various species (Majewska et aL, 2009), but 
only eight pig PAG genes (PPAG) have been 
identified (PPAG1-6, PPAG8 and PPAG10) 
(Szafranska and Panasiewicz, 2002; 
Panasiewicz et aL, 2004; Szafranska et aL, 
2006). The PPAGs are classified into two 
subfamilies: PPAG1- like (PPAG1,3 and 5) and 
PPAG2-like (PPAG2, 4, 6, 8 and 10), based on 



Developmental Genetics 


285 


their nucleotide and amino acid sequence 
homology/identity. Majewska et a/. (2009) 
have recently reported the chromosomal 
assignment of the PFAG gene family to the 
long arm of porcine chromosome 1. Although 
the definitive role of PAGs has yet to be clari¬ 
fied, Roberts et ai (1996) suggested that they 
may interfere with maternal immune function 
and thereby protect the conceptus. In vitro 
studies have revealed a potential role of the 
PAG family as chorionic signalling ligands that 
interact with gonadotrophin receptors in cyclic 
pigs and cows (Szafranska et a/., 2007), as 
well as during the early pregnancy of the pig 
(Panasiewicz et a/., 2007). However, the over¬ 
all role of PAGs is still under investigation in 
several species. These secretory proteins are 
easily detectable in maternal blood circulation 
and are used for pregnancy diagnosis (see 
Szafranska et a/., 2006). 


Genetic Control of Post-implantation 

Development 

The three germ layers and their derivates 

As mentioned earlier, by the end of gastrula- 
tion, three germ layers are established: endo- 
derm, mesoderm and ectoderm. Embryonic 
and postnatal derivates of each of these layers 
were described and generally understood many 
decades ago (Patten, 1948). Molecular genetic 
mechanisms driving this highly complex com¬ 
bination of processes began to emerge rela¬ 
tively recently. Zorn and Wells (2009) published 
one of the first reviews covering the entire 
process of endoderm development and organ 
formation. Here we can highlight only the 
major regulatory systems influencing the vari¬ 
ety of genes and processes involved in endo¬ 
derm morphogenesis and the formation of 
certain organs. The Nodal signalling pathway 
is necessary and sufficient to initiate ectoderm 
and mesoderm development, and is itself influ¬ 
enced by the canonic WNT/p-catenin pathway 
(Zorn and Wells, 2009). High-level Nodal sig¬ 
nalling supports endoderm development and 
lower activity specifies mesoderm identity. The 
activity of the Nodal pathway is controlled by 
an auto-regulatory loop. Several genes involved 


in the pathway, such as Nodal , xnrl and sqt , 
have conserved Foxhl DNA-binding sites in 
their first introns, sustaining the high activity 
essential for endoderm development. In con¬ 
trast, in developing ectoderm, a negative feed¬ 
back of Nodal activity is caused via the 
transcriptional target Lefty (Shen, 2007). Soon 
after gastrulation, the endoderm germ layer 
forms a primitive gut tube, which leads to 
organ specification, then to the formation of 
organ buds and finally to more specialized cell 
lineages. In some cases, as in hepatocyte and 
p-cell differentiation, the whole cascade of 
events and the genes involved were uncovered 
(Zorn and Wells, 2009). 

Developmental events in mesoderm and 
ectoderm progress simultaneously but inde¬ 
pendently, with significant interactions 
between derivates from the germ layers. As is 
well known, many organs have cellular com¬ 
ponents originating from different germ lay¬ 
ers. Certain genes play a key role in the earliest 
stages of germ layer development. For instance 
the Eed gene, initially identified in mice, is 
critical for embryonic ectoderm development 
(Sharan et a/., 1991), as deletion of this gene 
prevents formation of ectoderm. A highly 
homologous gene was found in humans, and 
there are few doubts that a porcine homologue 
will be identified. The murine Brachyury (T) 
gene is crucial for mesoderm development. 
Mice homozygous for mutant alleles of the T 
gene do not generate enough mesoderm, and 
show severe disruption in morphogenesis of 
mesoderm-derived structures, in particular of 
the notochord (Wilkinson et a/., 1990). One 
of the T-box genes, Tbx6, in mice is impli¬ 
cated in development of paraxial mesoderm 
(Chapman et a/., 1996). Tbx6 transcripts are 
first detected in the gastrulation-stage embryo 
in the primitive streak and in newly recruited 
paraxial mesoderm. 


Development of segment identity 

and HOX genes 

Segmentation, observed in different groups of 
animals and particularly in vertebrates, has 
deep evolutionary roots. Segments with a com¬ 
mon origin remain relatively separate during 
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development to create an opportunity for diver¬ 
sification and specialization. This evolutionary- 
developmental strategy has been commonly 
used for the creation of morphological struc¬ 
tures or groups of cells with distinct features. 
For instance, the development of two major 
structures, the ectodermal neural tube and the 
paraxial mesoderm, depends on segmentation: 
the first is critical for development of the hind¬ 
brain, the head process and the spinal cord; the 
second is essential in generation of somites, 
which give rise to the axial skeleton and skeletal 
muscles. In developing porcine embryos, visible 
features of segmentation are apparent at D14 
(Fatten, 1948). The first somites can be seen in 
the middle of the closing neural groove, and 
their number increases anteriorly. The genetic 
and cellular processes driving segmentation 
depend on the expression patterns of HOX 
genes (Alexander et a/., 2009). 

The homeotic genes, which encode helix- 
loop-helix transcription factors, were first 
described in Drosophila as the primary deter¬ 
minants of segment identity. They all contain a 
similar 180-bp DNA sequence motif named 
the homeobox. Comparative analysis of the 
Drosophila homeotic gene complex, called 
HOM-C, and the mammalian homeobox 
genes, called the HOX complex, demonstrates 
a striking case of evolutionary conservation. 
The HOX gene family determines a set of tran¬ 
scription factors crucial for the development of 
axial identity in a wide range of animal species 
(Maconochie et ah, 1996). Figure 12.7 shows 
the striking similarity and collinearity found in 
the molecular anatomy of the insect and mam¬ 
malian HOX complexes. The main difference 
is the number of complexes per genome. In 
insects there is only one, while mammals and 
other higher vertebrates have four separate 
chromosome clusters (Alexander et ah, 2009). 
There are 39 HOX genes in mammalian 
genomes, which belong to 13 paralogous 
groups. 

The HOX genes are expressed in segmen¬ 
tal fashion in the developing somites and cen¬ 
tral nervous system, and each HOX gene acts 
from a particular anterior limit in a posterior 
direction. The anterior and posterior limits are 
distinct for different Hox genes (Fig. 12.7). 
A hallmark of HOX genes is the correlation 
between their linear arrangement along the 


chromosome and the timing and AP limits of 
expression during development (Alexander 
et ah, 2009). HOX genes determine AP posi¬ 
tional identity within the paraxial and lateral 
mesoderm, neurectoderm, neural crest and 
endoderm. 

Thus, the vertebrate body plan is, at least 
partially, a result of the interactions of HOX 
genes that provide cells with the essential posi¬ 
tional and functional information. Signals from 
the HOX genes force embryonic cells to 
migrate to the appropriate destination and 
generate certain structures. Major signalling 
pathways such as the fibroblast growth factor 
(Fgf), Wnt and retinoic acid (RA) pathways play 
important roles in affecting the expression of 
different HOX genes in different developmen¬ 
tal conditions. The expression of RA and its 
protein-binding ability, as well as its other func¬ 
tions in development of the porcine conceptus 
have been described (Yelich et ah, 1997). RA 
can affect the expression of HOX genes and 
there is a 5' to 3' gradient in responsiveness of 
the genes to retinoids (Marshall et ah, 1996). 
RA acts via its receptors, which comprise two 
families, RAR and RXR, which are members of 
the ligand-activated nuclear receptor super¬ 
family. The receptors interact to form com¬ 
plexes that, in turn, regulate target gene 
binding to retinoic acid response elements 
(RAREs). These RAREs are found in the 5' 
regulatory regions of the murine Hox genes. 
HOX genes have profound influence on the 
whole array of developmental process and on 
the establishment of segment identity in a vari¬ 
ety of Metazoa. 

Organogenesis 

Hundreds, if not thousands, of genes are 
involved in organogenesis. Here we briefly 
describe only a few of them, which have 
numerous effects on the formation of organs 
during development. The T-box family of tran¬ 
scription factor genes is a good example of 
such massive involvement in organogenesis 
owing to their contribution to embryonic cell- 
fate decisions, the control of extra-embryonic 
structures, embryonic patterning and numer¬ 
ous aspects of organogenesis (Naiche et ah, 
2005). Some of the T-box genes are involved 
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Fig. 12.7. (a) Alignment of the four mouse (listed as vertebrate) Hox gene complexes with the HOM-C 
(ANT-C) gene complex from Drosophila. The vertical shaded boxes indicate related genes. The 13 
paralogous groups are noted at the bottom of the alignment. The collinear properties of the Hox 
complexes with respect to timing of expression, antero-posterior (AP) level and retinoic acid (RA) 
response are also noted at the bottom. From Maconochie et al. (1996), with the authors’ permission. 

(b) Summary of HOM-C and Hox-2 expression patterns. The upper part of the figure is a diagram of 
a 10-h Drosophila embryo with projections of the expression patterns of the different genes from 
the HOM-C gene complex to particular body segments.The lower part of the figure is a diagram of 
al 2-day mouse embryo with projections of expression patterns of different genes from the Hox-2 
complex to particular body segments. From McGinnis and Krumlauf (1992), with the authors’ 
permission, int, intercalary; mx, maxillary; la, labial. Changes in genetic nomenclature made after the 
initial publication of this figure were not introduced here. 
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in limb morphogenesis and specification of 
forelimb/hindlimb identity. It has been shown 
that Tbx5 and Tbx4 expression is primarily 
restricted to the developing forelimb and hind- 
limb buds, respectively. These two genes appear 
to have been divergently selected in vertebrate 
evolution to play a role in the differential speci¬ 
fication of fore- (pectoral) versus hind- (pelvic) 
limb identity (Gibson-Brown et a/., 1998). 
Mutations in the human TBX3 gene cause the 
ulnar-mammary syndrome, which is character¬ 
ized by limb deficiencies or duplications, mam¬ 
mary gland dysfunction and genital abnormalities. 
It was suggested that TBX3 and TBX5 evolved 
from a common ancestral gene and each has 
acquired specific yet complementary roles in 
patterninq the mammalian upper limb (Bamshad 

et a/., 1997). 

At least seven T-box genes, including 
Tbxl, Tbx2 , Tbx3, Tbx5 , Tbxl8 and Tbx20, 
are involved in heart development. Each T-box 
gene has a unique expression profile in specific 
heart regions (Naiche et a/., 2005). The study 
of transcriptional activation and repression of 
several T-box genes in three-dimensional space 
provides unique knowledge about the cardiac 
progenitor fields, as well as about the develop¬ 
ment of heart form, function and pathology 
(Stennard and Harvey, 2005). 

Pax genes are another family of develop¬ 
mental genes coding nuclear transcription 
factors. They contain the paired domain, a 
conserved amino acid motif with DNA-binding 
activity. Pax genes regulate the development 
of organs and structures such as the kidney, 
eye, ear, nose, limb muscles, vertebral column 
and brain. Vertebrate Pax genes are involved 
in pattern formation, possibly by determining 
the time and place of organ initiation or mor¬ 
phogenesis (Dahl et a/., 1997). Paxl, for 
instance, is a mediator of notochord signals 
during the dorsoventral specification of verte¬ 
brae (Koseki et a/., 1993). The Pax3 gene 
may mediate activation of MyoD and My/5, 
the myogenic regulatory factors, in response 
to muscle-inducing signals from either axial tis¬ 
sues or overlying ectoderm, and may act as a 
regulator of somitic myogenesis (Maroto et a/., 
1997). The Pax2 gene is involved in optic 
nerve formation and Pax6 is considered as a 
master gene for eye development as well as 
for the development of some other ectodermic 


tissues. Mutations in Pax6 result in eye mal¬ 
formation, known as aniridia in humans and 
small eye syndrome in mice (Dahl et a/., 
1997). The eyes absent gene (Eya2), which is 
involved in eye development in several meta¬ 
zoan phyla, seems to be relevant to pig devel¬ 
opment. Like the Pax6 gene family, Eya2 was 
probably recruited for visual system formation 
some considerable time after its original func¬ 
tion was established (Duncan et a/., 1997). 
Several other genes, such as Bmp4, Msxl and 
Msx2, which encode bone morphogenetic 
proteins and are expressed before and after 
neural tube closure, interact with Pax2 and 
Pax3 (Monsoro-Burq et a/., 1996). The 
expression of Pax 7 in the cells harvested from 
porcine embryos confirmed that the gene is 
involved in the development of myogenic sat¬ 
ellite cells (Singh et a/., 2007). Pax8 is essen¬ 
tial for the regulation of thyroid function (Kang 
et a/., 2001). 


Gene regulation of muscle development 

Obviously, information on the development of 
muscle tissue in the pig has great practical 
importance. The genetic mechanisms of mus¬ 
cle development in mammals are complex 
(Firulli and Olson, 1997). Skeletal, cardiac and 
smooth muscle cells express overlapping sets 
of muscle-specific genes, although some genes 
are only expressed in one particular muscle 
type. So-called genetic modules or independ¬ 
ent cis-regulatory regions are required to direct 
the complete developmental pattern of expres¬ 
sion of individual muscle-specific genes within 
each muscle type, and the temporo-spatial 
specificity of these regulatory modules is estab¬ 
lished by unique combinations of transcription 
factors (Firulli and Olson, 1997). 

The musculoskeletal system of the trunk and 
tail develops from the paraxial pre-somitic 
mesoderm (FSM) cells (Tam and Beddington, 
1987). Once these cells reach a specified position, 
gene expression changes significantly and a seg¬ 
mentation process begins. In mice, new somites 
appear approximately every 2h, and they are 
separated from the anterior PSM (Dunty et a/., 
2008). The bHLH transcription factor encoded by 
the Mesp2 gene and controlled by the Notch 
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signalling pathway is essential in the segmenta¬ 
tion programme (Saga et a/., 1997). Ripp\\)2 
is another identified gene involved in segment 
boundary regulation and this is also under the 
influence of Notch pathway genes, as well as 
the mesodermal transcription factors T and 
Tbx6 (Oginuma et a/., 2008). 

Somitogenesis is probably controlled by a 
segmentation clock, which consists of molecular 
oscillators in the Wnt3a, Fgf8 and Notch path¬ 
ways (Pourquie, 2003; Aulehla and Herrmann, 
2004; Rida et a/., 2004). Alternatively directed 
gradients of fibroblast growth factor 8 (Fgf8) 
and/or Wnt3a and retinoic acid (RA) establish a 
boundary front in the anterior PSM. Dunty et 
al. (2008) demonstrated that the canonical 
Wnt3a/p-catenin pathway is necessary for 
molecular oscillations in all three signalling path¬ 
ways, but does not function as an integral com¬ 
ponent of the oscillator. On the contrary, Notch 
pathway genes continue to oscillate in the pres¬ 
ence of stabilized p-catenin and thus drive peri¬ 
odic expression of the target genes lunatic fringe 
(Lfng) and Hes7 (Bessho et al. , 2001; Morimoto 
et a/., 2005). Further investigations are neces¬ 
sary in order to reach a deeper understanding of 
this sophisticated dynamic system. 

It is well established that mitogens inhibit 
the differentiation of skeletal muscle cells, but 
the IGFs, acting through a single receptor, 
stimulate both the proliferation and differenti¬ 
ation of myoblasts. For example, an inhibitor 
of mitogen-activated protein (MAP) kinase 
inhibits IGF-stimulated proliferation of L6A1 
myoblasts and associated events, such as 
phosphorylation of the MAP kinases and ele¬ 
vation of c-fos mRNA and cyclin D protein. 
This inhibitor caused a dramatic enhancement 
of differentiation, evident at both a morpho¬ 
logical and biochemical level. In sharp con¬ 
trast, an inhibitor of phosphatidylinositol 
3-kinase and p70 S6 kinase completely abol¬ 
ished IGF stimulation of L6A1 differentiation. 
These data demonstrate that the MAP kinase 
pathway plays a primary role in the mitogenic 
response and is inhibitory to the myogenic 
response in L6A1 myoblasts, while activation 
of the phosphatidylinositol 3-kinase/p70 (S6k) 
pathway is essential for IGF-stimulated differ¬ 
entiation. Thus, it appears that signalling 
from the IGF-1 receptor utilizes two distinct 
pathways that lead to either the proliferation 


or the differentiation of muscle cells (Coolican 
et a/., 1997). IGF-I and II, their binding pro¬ 
teins and members of the transforming growth 
factor (TGF) superfamily (myostatin and 
TGF(31) are key regulators of the proliferation 
and differentiation of porcine myogenic cells. 
Changes in the relative expression of IGF and 
TGFp play a considerable role in regulating 
myogenesis in porcine embryonic myogenic 
cell (PEMC) cultures (Xi et a/., 2007). 

Members of the B-cell translocation gene 
family with anti-proliferative properties (BTG1 , 
BTG2 and BTG3) are involved in muscle 
development and growth in pigs. The BTG2 
gene is expressed at high levels in skeletal 
muscle and heart. The expression of BTG2 
and BTG3 was significantly different in skele¬ 
tal muscle among different developmental 
stages and between two studied breeds. Both 
genes were induced in differentiated C2C12 
cells, suggesting their involvement in myo¬ 
genic differentiation (Feng et a/., 2007). 
Selection for greater and leaner muscle mass 
in pigs picked up a number of mutations affect¬ 
ing muscle development and functioning (see 
Chapter 15). This includes RYR , RN , MU and 
some other genes. 


The major steps in gonad differentiation 

The earliest steps of gonadal development in 
mammals commence at a similar period in XX 
and XY embryos. Primordial germ cells, which 
differentiate relatively late in mammals, migrate 
into the gonad area of either presumptive sex 
indiscriminately, and may function even across 
species barriers (McLaren 1998, 1999). 

Anderson (2000) described sexual develop¬ 
ment in pig embryos in sufficient detail. To be 
functional, a gonad needs both germ cells and 
somatic cells. It seems that gonadal develop¬ 
ment in the pig does not significantly deviate 
from that in other mammals. A few dozen 
germ cells, originating from the proximal 
region of the embryonic ectoderm, start their 
journey inside the embryo, along with the 
invaginating hindgut. In the pig, primordial 
germ cells migrate from the dorsal mesentery 



290 


M. Dyck and A. Ruvinsky 


of the hindgut (D18) to the primordium of the 
gonad (D23), and enter the genital ridge by 
D26. The number of these cells in the porcine 
embryo increases from D18 to D26 (Takagi 
et a/., 1997). The first differences between 
male and female porcine embryos start to 
emerge at D26. Y chromosome-carrying 
embryos develop testicular cords and intersti- 
tium, and begin testosterone production, while 
undifferentiated gonadal blastema can be seen 
in XX embryos. Further events in males follow 
without delay; Sertoli cells can be observed at 
D27, production of anti-Mullerian hormone 
(AMH) starts at D29, and production of 
3p-HSD (hydroxysteroid dehydrogenase) 
begins at D30-35, leading to the appearance 
of Leydig cells. Mullerian ducts disappear and 
Wolffian ducts are transformed into epididymi¬ 
des, vas deferens, seminal vesicles, the pros¬ 
tate and other structures. From approximately 
D100 of fetal development until D20 of post¬ 
natal development, germ cells show a constant 
rate of doubling (Anderson, 2000). 

In female embryos, egg nests develop by 
D30-35, and meiosis resumes at D40. As 
previously mentioned, at the time of birth the 
population of germ cells is approximately 
400,000. The pre-meiotic diplotene porcine 
oocyte appears by D50, and this process 
continues until about D20 after birth. In 
female embryos, Mullerian ducts are trans¬ 
formed into oviducts, uterus, cervix and the 
upper parts of the vagina, while Wolffian 
ducts disappear (Anderson, 2000). Primordial 
follicles are first observed in ovaries at D68 
and primary follicles at about D75; secondary 
follicles appear near the time of birth. Starting 
from about D70 post coitum until D90 after 
birth, primordial follicles account for about 
80% of the total follicular population (Oxender 

et a/., 1979). 

In mice, expression of the Bmp4 gene 
(bone morphogenetic protein 4) in the TE 
layer, which is in the closest contact with the 
epiblast, is responsible for the differentiation 
of both the primordial germ cells and the 
allantois (Lawson et a/., 1999). If a similar 
mechanism operates in the pig, then BMP4 
protein would be also produced, and thus pre¬ 
cedes cellular migration. Owing to ongoing 
proliferation, a significant number of germ 
cells reach the genital ridge, which consists of 


a thin layer of mesenchymal cells located 
between the coelomic epithelium and the 
mesonephros. Two genes, Sfl and Wtl, are 
particularly important in the development of 
the murine genital ridge (McLaren, 1998). 
Eventually, four different cell lines comprise 
the genital ridge: primordial germ cells, 
somatic steroidogenic cells, supporting cells 
and connective tissue. The fate of each lin¬ 
eage depends on the sexual determination of 
the embryo in which they develop, and the 
patterns of genetic activity are quite different 
in the testes and the ovaries. The porcine 
DDX4 gene is a homologue to the vasa gene 
identified in other mammalian species (Lee 
et a/., 2005). In adult tissues, DDX4 tran¬ 
scription was restricted only to the ovary and 
testis. In porcine fetuses, the transcript was 
found in all stages, except for D17-18. The 
DDX4 protein was observed in proliferating 
primordial germ cells but not in embryonic 
germ cells. 


The genes involved in sex differentiation 

Gonad development is a key element in estab¬ 
lishing mammalian sex. The chromosomal 
constitution determines the migration of cells 
into the gonads and the final differentiation 
into a testis or an ovary (Ffunter, 1995). It is 
well known that sex determination in pigs, as 
well as in many other mammalian species, 
depends on the presence or absence of the Y 
chromosome. Embryos without the Y chromo¬ 
some develop as females, and those that carry 
this chromosome develop as males. The break¬ 
through in molecular understanding of sex 
determination and differentiation in the mouse 
and human (Goodfellow and Lovell-Badge, 
1993) paved the way for that in other mam¬ 
mals, including the pig. 

The study of porcine SRY revealed that the 
open reading frame (ORF) of the gene consists 
of 624bp. The protein has a centrally located 
high mobility group (HMG) box domain as well 
as amino terminal and carboxy terminal regions. 
Structurally, porcine SRY is more similar to 
bovine and human SRY than to mouse Sry. 
The porcine SRY gene is expressed within the 
cells of the genital ridge of the developing male 
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pig embryo between D21 and D26 of gestation. 
At this time the primitive gonads are still bipo¬ 
tential; however, after D31, testis determination 
is evident (Daneau et a/., 1996). Comparative 
analysis of the gene order on the Y chromo¬ 
some between pig, human and mouse showed a 
number of differences. There is greater order 
conservation with the murine Y than with the 
human Y chromosome. The porcine SRY is 
located on the p arm of the Y chromosome 
close to the centromere (Quilter et a/., 2002). 

In genetic males, porcine genital ridge cells 
express not only the Y chromosome-located 
SRY and ZFY genes and the X chromosome- 
located DAX1 (or NR0B1, which encodes a 
nuclear receptor protein) and DFX genes, but 
also the autosomal AMH, WT1 , SOX2 and 
SOX9, SF1 (steroidogenic factor-1) and several 
other genes (Lahbib-Mansais et a/., 1997; Parma 


et a/., 1997). The SF1 protein transactivates the 
porcine SRY promoter (Pilon et a/., 2003). 

Figure 12.8 shows a simplified genetic 
model describing the major events in sex deter¬ 
mination pathways. The activation of SRY in 
normal XY embryos shifts the balance in favour 
of testis development and male pathways 
through up-regulation of the SOX9 gene and 
signalling of FGF9 (fibroblast growth factor 9), 
which promotes the secretion of prostaglandin 

D 2 (PGD 2 ) (Nef and Vassilli, 2009). FGF9 and 
PGD 2 form a positive feedback loop and inten¬ 
sify SOX9 expression, thus directing the dif¬ 
ferentiation of supporting cells to Sertoli cells, 
which in turn down-regulate female signals 
such as WNT4 and FOXL2 (Fookhead tran¬ 
scription factor; Lamba et a/., 2009). Thus, 
testicular development in mammals is triggered 
by SRY. In genetic males, this factor induces 
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Fig. 12.8. A genetic model for sex determination, controlled by a balance of antagonistic pathways. In XY 
gonads, Sry triggers the up-regulation of Sox9, leading to Sertoli cell commitment and testicular 
differentiation. Sertoli cell differentiation is a result of the establishment of a positive feedback loop between 
Sox9 and the secretion of Fgf9 (fibroblast growth factor 9), and also PGD 2 (prostaglandin D 2 ; not shown), 
which act in a paracrine manner to recruit additional Sertoli cells. In XX gonads, two independent signalling 
pathways involving the Rspol /Wnt4/p-catenin pathway and Foxl2 tilt the balance towards the female side 
and silence Sox9 and Fgf9. Arrows indicate stimulation;! bars indicate inhibition. Reproduced from Nef and 
Vassilli (2009), with the kind permission of the authors. 
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the differentiation of Sertoli cells (reviewed by 
McLaren, 1991) and the secretion of AMH. 
AMH, which belongs to the transforming 
growth factor p family, causes regression of 
Mullerian ducts and promotes the development 
of Wolffian ducts and the differentiation of 
Leydig cells, which secrete the male steroid 
hormone, testosterone (Behringer, 1995). 
Testosterone binds to androgen receptors, 
which act as transcription factors. 

In the absence of an SRY gene, which is 
typical for XX embryos, the gonads develop 
into ovaries. Two independent signalling path¬ 
ways, the R-spondinl/WNT4/p-catenin path¬ 
way and FOXL2 transcription factor pathway, 
support this developmental sequence of 
events. R-spondinl has been recently recog¬ 
nized as a key female-determining factor (Nef 
and Vassilli, 2009). In females, no Leydig cells 
are formed, no testosterone is produced and 
gonad development steadily moves towards 
the female phenotype. Quite often the female 
developmental programme is considered as 
the ‘default’, while the male programme 
requires ‘switching on’ of the SRY gene, fol¬ 
lowed by activation of other genes. The com¬ 
parison of four regulatory regions located 
upstream of SRY shows high conservation 
between the human, bovine, pig and goat 
regions. These regions of homology share 
transcription factor-binding sites that appear 
to be subject to strong evolutionary pressure 
for conservation and may, therefore, be 
important for the correct regulation of SRY 
(Ross et a/., 2008). In contrast, the structure 
of the SRY region in the mouse is significantly 
different. 


Cycle of the X chromosome 

As proposed by Lyon (1961), and now uni¬ 
formly accepted, one of the X chromosomes in 
eutherian females undergoes inactivation dur¬ 
ing early embryonic development. Numerous 
investigations shed light on different aspects of 
X chromosome behaviour, including preferen¬ 
tial inactivation of the paternal X chromosome 
in the trophoblast, random inactivation in the 
inner cell mass and molecular mechanisms of 
inactivation (Goto and Monk, 1998). 


These scenarios appear to be completely 
relevant to the cycle of the X chromosome in 
the pig; however, the supporting information 
is still lacking. Preferential inactivation of 
paternal X chromosome in porcine XX 
embryos probably occurs in trophoblastic 
cells around D7.5, and then random inactiva¬ 
tion follows in the embryonic disc cells around 
D11.5. Females become natural mosaics, 
with one X chromosome randomly inacti¬ 
vated in each somatic cell. In post-meiotic 
oocytes, the X chromosome is active, as in 
other mammalian species. The paternal X 
chromosome, on the contrary, enters the 
zygote being inactive, but, soon after fertiliza¬ 
tion, it reactivates. In XX embryos, both X 
chromosomes are expected to be active until 
trophoblast differentiation. Then only one X 
chromosome remains active, regardless of 
the number of X chromosomes in a cell. This 
is an essential condition for gene dose com¬ 
pensation. The mechanisms of silencing the 
inactive X chromosome are complex. Several 
chromatin modifications are necessary in 
order to form stable facultative chromatin 
capable of propagating through numerous 
cell divisions. The so-called X-inactivation 
centre located on the X chromosome con¬ 
tains the Xist gene and cis regulatory genetic 
elements. The Xist gene encodes an RNA 
molecule that plays a key role in silencing the 
inactive X chromosome (Plath et a/., 2002). 
Xist is negatively regulated by its antisense 
transcript, Tsix. It seems, however, that Tsix 
(the reverse spelling of Xist) is not the only 
regulator, and additional transcription factors 
are involved in this complex process (Senner 
and Brockdorff, 2009). It has been demon¬ 
strated that porcine Xist gene expression 
may be affected by maternal metabolic state 
at the time of ovulation (Vinsky et a/., 2007). 
As this study shows, sows who were in a neg¬ 
ative metabolic state during the week before 
ovulation and fertilization not only demon¬ 
strated greater than usual embryonic mortal¬ 
ity by D30, but the mortality of female 
embryos was greater than that of the male 
embryos. This was attributed to aberrant Xist 
expression in female embryos, suggesting 
that maternally influenced epigenetic defects 
may contribute to sex-biased embryonic loss 
in the pig. 
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Anomalies in sex determination 
and differentiation 

As indicated above, the SRY gene plays a criti¬ 
cal role in sex differentiation, and usually only 
embryos carrying the Y chromosome possess 
SRY. However, SRY can become non¬ 
functional or be transferred from the Y chromo¬ 
some to the X chromosome by a rare 
recombination event. Such events can cause 
sex reversal, whereby XY individuals become 
females and XX individuals become males. In 
either case, intersex individuals can arise 
(Cattanach et a/., 1982). In humans, XY sex 
reversals are rather frequent (about 1 in 3000 
newborns) and are genetically heterogeneous. 
XX sex reversals, on the contrary, are rare 
(about 1 in 20,000 newborns) and are usually 
caused by the translocation of SRY. 
Surprisingly, in the majority of human cases 
(-75%) the cause of sex reversal cannot be pre¬ 
cisely identified (Nef and Vassilli, 2009). 

A recent comparative study of several 
mammalian species, including the pig, 
revealed potentially important upstream- 
located SRY regulatory elements, mutations 
of which might lead to XY sex reversal (Ross 
et a/., 2008). It seems, however, that cur¬ 
rently available data do not point towards a 
significant importance of this phenomenon in 
the pig, as the majority of sex reversals found 
so far are mainly SRY-negative individuals 
with a 38 chromosome, XX (38,XX) karyo¬ 
type. A number of pig intersexes with 
ovo-testes were examined. These animals 
were usually infertile without clear spermato- 
genetic or oogenic activity. In some of them, 
the presence of a properly developed uterus 
and ducti deferens was observed, but oviducts 
were not found. In many of these cases, the 
sex-reversal status was likely to have been 
caused by an autosomal recessive mutation 
(Hunter, 1996; Switonski et a/., 2002). In an 
attempt to understand the aetiology of inter¬ 
sexuality in such pigs, the gonads of 38,XX 
(SRY-negative) female to male sex-reversed 
animals were studied during fetal life (50 and 
70 days post coitum (dpc)), after birth (35 days 
post-partum (dpp)) and during adulthood. At 
the fetal stages, an elevated expression of 
SOX9, AMH and the P450 aromatase gene 
(CYP2E1) were observed. In addition to genes 


involved in the testicular pathway, the same 
gonads expressed an ovarian-specific factor 
FOXL2. The genes involved in this pathology 
pathway act early during gonadogenesis and 
affect the ovary-differentiating pathway. This 
process occurs with variable expressivity, 
producing embryos with abnormalities rang¬ 
ing from ovarian germ cell depletion through 
to trans-differentiation into testicular struc¬ 
tures (Pailhoux et a/., 2001). 

The rare identification of XY pig females 
which carry a duplication of a certain X chro¬ 
mosome region (Xp21) led to the hypothesis 
that a double dose of a gene might cause sex 
reversal. The DAX1 gene (NR0B1), was iso¬ 
lated from this region and considered as a can¬ 
didate. In porcine embryonic gonads, DAX1 
expression starts in the urogenital ridges of 
both XX and XY porcine embryos between 
D20 and D22 of development, and continues 
until adulthood (Parma et a/., 1997). A recent 
human study with a similar gonad disorder 
confirmed that indeed the NR0B1 ( DAX1 ) 
gene was duplicated in such patients (Barbaro 
et a/., 2008). 

Aneuploidy and chimerism for sex chro¬ 
mosomes were also described in pigs. 
Depending on karyotype, such individuals may 
widely vary from nearly normal males to nearly 
normal females, with all kinds of intersexes in 
between (see Chapter 7). 


Summary 

The progress in developmental genetics of 
the pig has been significant in recent years, 
particularly in regard to conceptus growth, 
fetal-maternal interactions and placental 
attachment. However, the understanding of 
species-specific features of the earlier stages 
of development up to blastocyst formation 
and hatching has grown more slowly, and 
we have to rely on mammalian developmen¬ 
tal genetics, which is mainly built on mouse 
data. One can anticipate that further 
advancement of molecular genetics methods 
will eventually improve our understanding of 
the gene expression profiles of cells and tis¬ 
sues involved in the critical and complex 
developmental stages in the pig. Through 
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continued gene discovery and functional 
genomics research on the establishment and 
maintenance of pregnancy in the pig, we 
will be able to manage key developmental 
events better, and in doing so improve con- 
ceptus development and survival rate in the 

Pis- 


Note 

In this chapter, mouse genes follow the rules 
for murine genetic nomenclature, which is 
slightly different from that for other mammal¬ 
ian species: namely, only the first letter of a 
gene name is capitalized. 
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Introduction 

Genetic variation in the pig, as in other farm 
animal species, is known to be a resource of 
utmost importance. Pigs, indeed, account for 
about 40% of the world production of meat. 
Diversity of the genetic stock, understood as 
genetic variation, is a prerequisite for pig 
improvement, because variation is the raw 
material on which the breeder works (Lush, 
1945). Genetic variation in pig populations is 
also needed to maintain fitness by counteract¬ 
ing inbreeding depression. Similarly, in natural 
populations, conservation of genetic variation 
is needed to allow evolution in response 
to environmental changes and to maintain 
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population fitness (Reed and Frankham, 2003). 
Conservation of both agricultural and wild 
species genetic diversity is covered by the 
Convention on Biological Diversity (CBD), now 
ratified by 186 countries, following the Rio 
Conference of 1992 (FAO, 2007). 

In this chapter we shall update the corre¬ 
sponding chapter by Hammond and Leitch in 
the 1998 (first) edition of The Genetics of the 
Pig. In the first section, what is known of the 
status of the world pig genetic resources and 
how this knowledge has been extended in the 
past decades will be reviewed. We shall then 
present the current methodologies for charac¬ 
terizing genetic resources by the use of porcine 
genetic markers, a field of research that has 

©CAB International 2011. The Genetics of the Pig, 
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undergone considerable development since 
the 1980s. In the last section, potential links 
between molecular and quantitative trait diver¬ 
sity will be discussed, as well as information 
derived from molecular studies of diverse 
genetic stocks for elucidating the genetic basis 
of production and fitness traits, a field presently 
in considerable progress. 


Status of the World Pig Genetic 

Resources 

Inventories 

A proper characterization of the available pig 
genetic resources is obviously needed for their 
management. Several levels of characterization 
exist. One of these can be achieved by a com¬ 
prehensive inventory of the available breeds 
and populations. Pig genetic resources include 
a wide spectrum of populations - or breeds - 
and various methodologies have been pro¬ 
posed to classify breeds of farm animals. The 
first attempt was probably in Mason’s World 
Dictionary of Livestock Breeds, Types and 
Varieties (1951). Mason’s approach was later 
developed and clarified by Lauvergne (1982), 
who proposed classifying breeds into four cat¬ 
egories on a domestication evolutionary scale, 
namely: (i) wild or feral populations; (ii) tradi¬ 
tional or indigenous populations, usually not 
uniform and lacking any breeders’ organiza¬ 
tion; (iii) standardized breeds, usually managed 
within breeding organizations; and (iv) selected 


lines, generally part of crossbreeding systems. 
A milestone in the inventory of farm animal 
populations has been Mason’s Dictionary , 
mentioned above, which has remained a classic 
with many further editions until 1996, and a 
new (5th) edition revised by Porter that was 
published in 2002. In the 1988 edition, which 
presents a compilation of over 500 pig popula¬ 
tions (Mason, 1988), one can find representa¬ 
tives of the above four categories, with a marked 
predominance of categories (ii) and (iii). 

Extensive country-driven inventories of 
farm animal populations have subsequently 
been launched, first in Europe in the early 
1980s under the aegis of the European 
Association for Animal Production (see Simon 
and Buchenauer, 1993, for details). Later, after 
the Rio Convention of 1992, in recognition of 
the importance of domestic animal genetic 
resources, FAO was ascribed a mandate for 
their management, and initiated a special 
action programme, the details of which are 
presented in Hammond and Leitch (1998). 
Within this framework, FAO launched an 
inventory and basic description of the breeds of 
domestic species, which can now be accessed 
from the Domestic Animal Diversity Information 
System (DAD-IS) (http://dad.fao.org/). An 
impressive amount of information has thus 
been gathered all over the world during the last 
25 years, and this has recently been integrated 
into a global assessment of the world’s animal 
genetic resources (FAO, 2007). Table 13.1 
shows the distribution of the pig breeds pres¬ 
ently known (in 2009) over the regions of the 
world, and compares this with the situation in 


Table 13.1. Pig breed inventories by region of the world compared with mammalian species of farm 
animals. Sources: Mason (1988); FAO/UNEP (1995); Domestic Animal Diversity Information System 
(DAD-IS at http://dad.fao.org/, accessed October 2009). 


Region of 
the world 3 

Africa 

America 


Asia and 
the 

Pacific 

Europe 

Near 

East 

Total 

pigs 

Mammalian 

domesticated 

species b 

Latin America 
and the Caribbean 

North 

America 

1988 situation 

7 

56 


152 

160 

0 

375 

3327 

1995 situation 

13 

24 

28 

157 

129 

2 

353 

2442 

2009 situation 

143 

241 

64 

422 

495 

12 

1377 

7835 

Increase (fold) 

20.4 

5.4 


2.8 

3.1 

nd 

3.7 

2.4 


a FAO definition. 

b Cattle + horses + goats + pigs + sheep, 
nd, no data. 
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1995 and 1988. It reveals a nearly fourfold 
increase in global coverage over the last 21 
years: 1377 breeds in 2009 as against 375 in 
Mason’s dictionary (Mason, 1988). 

This enormous increase in the number of 
‘breeds’ needs explanation. The first reason 
for the list enlargement is a growing aware¬ 
ness all over the world of the usefulness of 
identifying the amount of genetic diversity 
available in each country, and consequently 
an improved reporting under the incentives 
provided by FAO’s Global Programme for the 
Management of Farm Animal Genetic 
Resources (Hammond and Leitch, 1998). The 
second reason lies in the redefinition and 
broadening of the concept of genetic resource. 
Interest in animal genetic resources arose in 
the 1960s from an awareness of the potential 
dangers created by a continuous decrease in 
the number of breeds commercially exploited, 
and emphasis was therefore initially put on 
the conservation of endangered breeds. Then 
a distinction was made more clearly between 
conservation per se and general livestock 
resource management (see Barker, 1986). It 
is now recognized that, as mentioned above, 
the concept of a genetic resource should 
include all identified commercial breeds or 
lines, whatever their size or importance, as 
well as more specific categories of animals, 
such as wild populations or highly selected 
lines developed for research purposes. A dis¬ 
tinction is also now being made at the FAO 
level between local (or ‘regional transbound¬ 
ary’) breeds that occur only in one country (or 
region) and ‘international transboundary’ 
breeds that occur in more than one region. In 
pigs, there are relatively few regional trans¬ 
boundary breeds, and the simpler distinction 
between local and international breeds may 
therefore be retained for all practical pur¬ 
poses. As can be seen by comparing Tables 
13.1 and 13.3, less than half of the breeds 
reported in DAD-IS are local breeds. A total 
of 33 international pig breeds have been 
reported, five of which are widely predomi¬ 
nant: Duroc, Hampshire, Landrace, Large 
White (Yorkshire) and Pietrain. Several of 
these entities may themselves be divided into 
different country varieties that can be consid¬ 
ered as truly different breeds, such as, for 
example, Belgian and Danish Landrace. For 


more details on breeds of pigs see Buchanan 
and Stalder (Chapter 18). 


Use of breed information 

An important use of breed information is for 
assessing the degree to which each breed is 
exposed to the risk of becoming extinct, i.e. its 
degree of endangerment (DE). Assessing DE 
properly, however, is a difficult task because it 
requires integrating genetic and demographic 
factors which interact in various ways to deter¬ 
mine the survival prospect of any breed; this 
was stressed by Gandini et al. (2004), who dis¬ 
cussed various systems used for classifying 
breeds based on their DEs. 

The most used genetic criterion of DE is 
effective population size (N e ), which was defined 
by Wright (1931) as N e = 4 MF/(M + F), which 
combines the number of breeding males (M) 
and females (F) and allows prediction of the rate 
of increase (AF) in inbreeding as AF = l/2N e . 
A critical effective size may then be defined as 
the one allowing a given level of inbreeding to 
be reached within a given period of time (e.g. 
50 years), a system proposed by Simon and 
Buchenauer (1993), and now used by the 
European Farm Animal Biodiversity Information 
System (EFABIS) (http://efabis.tzv.fal.de/). In 
the pig, critical effective sizes for inbreeding 
values of F = 0.25 or 0.40 within 50 years are 
87 and 49, respectively (Ollivier, 1998). It 

should, however, be noted that inbreeding is 
expected to increase faster than is assumed 
from effective size. Two main reasons explain 
this: the first is that selection by itself increases 
average kinship among breeding animals and 
tends to accelerate inbreeding, as first noted by 
Robertson (1961), and the second is that effect¬ 
ive size derived from the number of breeding 
animals may be overestimated when genera¬ 
tions overlap. The overestimation depends on 
age at first offspring and survival rate (or culling 
policy) of the breeding animals. In the pig, 
assuming geometric age distributions for boars 
and sows, the overestimation may range from 
14 to 72% (see Ollivier and James, 2004, 
Table 1 and references therein). 

Breed extinction may otherwise be modelled 
in demographic terms. Demography classically 
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considers population growth and its variance as 
due to the combination of strictly demographic 
stochasticity (birth and survival processes), envi¬ 
ronmental stochasticity (unpredictable changes in 
the environment) and catastrophes such as 
disease outbreaks. As an example, rates of 
growth of 110 breeds of cattle over 13 years 
have been analysed by Gandini et al. (2004) and 
shown to be significantly affected by herd size 
and country. More data would be needed for pre¬ 
dicting the variance of population growth rate in 
a given breed or over a set of similar breeds. 
Continuous reporting on breed structure and 
inbreeding is encouraged in DAD-IS and may, in 
the course of time, become a useful tool for mon¬ 
itoring DE. In Table 13.2 it is shown that, together 
with a marked increase in the number of breeds 
being reported, breed-related information 
remains far from being complete. More than 
one-third of all reported pig breeds lack popula¬ 
tion data and cannot be evaluated for DE; this is 
also the average situation for other domestic 
species. Another weakness of the current moni¬ 
toring of breed erosion is that it cannot capture 
the effects of uncontrolled crossbreeding (FAO, 
2007). Table 13.2 shows a tendency for an 
increase of risk status over the last decade. This 
overall picture, however, covers widely different 
situations according to the region of the world, 
which is seen when referring to DAD-IS. 

Breed inventories also include information 
on the production characteristics of the various 
breeds, but these data are mostly confounded 
with breed environment and therefore do not 
tell much about real genetic diversity. Within a 
country, however, confounding may to some 
extent be avoided by expressing breed per¬ 
formance as a deviation from a standard breed, 
as suggested by Simon and Buchenauer (1993). 
Well-designed experiments, such as diallel 
crosses, would be too costly to implement in 


pigs because of the considerable testing facili¬ 
ties required as soon as the number of breeds 
exceeds three or four. But, when the number 
of breeds to compare is large, multi-breed 
designs would allow the testing of a large 
number of breeds using a few animals per 
breed (Taylor, 1976). This offers a suitable 
approach for testing diversity among a large 
number of rare breeds, but it has rarely been 
implemented in pigs, to our knowledge. 


Conservation 

Strategies for maintaining domestic animal 
genetic diversity depend on the degree of con¬ 
trol humans have over the available livestock 
populations. In that respect, two broad catego¬ 
ries of populations may be considered, namely 
commercial populations, in which the issue is 
to maintain internal diversity, and genetic 
stocks (or so-called gene banks), which may be 
used as stores for maintaining between-breed 
diversity. 

As mentioned above, genetic variation in 
commercial populations tends to decrease 
more or less automatically under the combined 
effects of restricted population size and selec¬ 
tion. When pedigrees are missing, effective 
size may be manipulated by changing the herd 
demography parameters, such as age at first 
breeding and culling policy. As an example, 
the ratio N e /N of effective size to herd size (N) 
is expected to increase to above one when 
male turnover is high. In fact, the balance 
between the positive effects on generation 
interval of high survival rate or late culling and 
their negative effects on genetic drift plays in 
favour of short breeding lifespan and early 
culling (Ollivier and James, 2004). When 


Table 13.2. Risk status of pig breeds. Sources: 1995 data: Hammond and Leitch (1998); 2006 data: from 
Fig. 12 in FAO (2007). 

Breed status 1995 situation 2006 situation 


Total number of breeds on file (breeds 353 599 

extinct excluded) 

Number of breeds with population data 265 (75.0%) 374 (62.4%) 

and percentage of total on file (in parentheses) 

Number of breeds at risk and percentage 69 (26.0%) 133 (35.6%) 

of breeds with population data (in parentheses) 
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pedigrees are available, effective size may be 
easily monitored through the calculation of 
individual inbreeding coefficients. Calculations 
may, however, become critical for very large 
populations, as discussed by Colleau (2002), 
who has proposed an indirect method that is 
able to considerably reduce computation time. 
Various software is available for pedigree 
analysis, such as (among others), pedig, from 
Boichard (2002) - now updated to pedig2007 
(Boichard, 2007). Pedigree analyses may 
reveal unexpected bottlenecks and subsequent 
episodes of rapid increase of inbreeding, even 
in quite large populations of pigs of the inter¬ 
national breed category (Maignel and Labroue, 
2001). Pedigree information allows the control 
of effective size very efficiently through mating 
schemes intended to minimize inbreeding/ 
genetic drift (see Fernandez et a/., 2003, and 
Sanchez etal., 2003, for a variety of schemes). 
Robertson (1960) has argued that such 
schemes, while aiming at increasing effective 
size for a given population size, generally lead 
to a reduction of the genetic variance exposed 
to selection, and then cannot increase the limit 
eventually reached by selection. In the short/ 
medium term, however, a joint management of 
inbreeding and selection advance is recom¬ 
mended (Kinghorn et a/., 2009). Compromises 
between selection advance and genetic drift 
may be needed in pig breeding schemes, but it 
should also be kept in mind that optimal mat¬ 
ing schemes may considerably reduce inbreed¬ 
ing without any detrimental effect on selection 
response, as shown by Colleau and Tribout 
(2008) in a retrospective study referring to 
matings actually performed. The theory of 
genetic contributions by Woolliams and 
Thompson (1994) can be used to predict the 
rate of inbreeding in populations undergoing 
selection. Using this theory, Bijma et al. (2001) 
showed how inbreeding can be predicted in 
specific livestock improvement schemes. 

Conservation of animal genetic resources, 
understood as maintenance of between-breed 
diversity, may be achieved either by live ani¬ 
mals (in vivo conservation) or by cryogenic 
storage (in vitro conservation). Let us also note 
here that in vivo conservation itself may be 
achieved in situ, i.e. under commercial farm 
conditions, or ex situ. This distinction, how¬ 
ever, often remains unclear. Live animal gene 


banks offer good opportunities for controlling 
genetic drift/inbreeding, particularly when a 
strict genetic management can be guaranteed 
by institutions such as farm parks, museums, 
zoos, national parks, universities or research 
organizations. Several examples of (long-term) 
conservation of farm animal stocks exist for 
various species (see review in FAO, 2007, 
p. 461), of which examples are given for sheep 
by the French (Rambouillet) Merino flock 
(Prod’Homme and Lauvergne, 1993) and for 
pigs by the Spanish Large White herd at 
Pontevedra, Spain (Solano, 1984). Although 
those projects differ in timespan and genetic 
management, they share the common feature 
that, in spite of the high levels of inbreeding 
generally reached, fitness traits show no major 
deterioration. This indicates that natural selec¬ 
tion is able to prevent inbreeding depression 
when the rate of increase of inbreeding is 
moderate, probably by purging deleterious 
recessive genes. On the other hand, the contri¬ 
bution of mutations to quantitative trait varia¬ 
tion may guarantee that a reasonable level of 
genetic variation be maintained over the long 
term (Hill, 1982). Such a theoretical prediction 
would particularly apply to traits not currently 
selected for, such as future breeding goals, 
which are by definition unpredictable and 
consequently under no direct selection pres¬ 
sure. The maintenance of fitness in artificial 
selection programmes has been recently 
reviewed by Hill and Zhang (2009). 

Cryogenic storage of animal genetic 
resources (in vitro conservation) implies vari¬ 
ous harvesting, processing, monitoring and 
maintenance procedures applied to biological 
materials such as animal cells, semen, oocytes 
or embryos (for details of the techniques used 
in pigs, see the review of Pizzi et al., 2001, 
and also Ross and Prather, Chapter 11). 
Storing semen and embryos is probably the 
most secure way of preserving genetic stocks, 
and it may also be the cheapest. The costs of 
different methods of conservation of farm ani¬ 
mals were evaluated for the first time by Smith 
(1984), who stressed the contrast between a 
high initial investment for establishing cryo¬ 
genic banks, particularly when embryos are to 
be collected, and the low annual storage costs. 
Consequently, cryogenic storage becomes 
cheaper than live animal conservation in the 
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course of time. More recent papers on the 
design of cryopreservation in farm animals 
have been reviewed by Labroue et al. (2001), 
who gave particular attention to the pig, a spe¬ 
cies in which cryogenic storage must essentially 
rely on frozen semen, because embryo conser¬ 
vation is not yet fully operational, in spite of 
recent progress in ultra-rapid cooling tech¬ 
niques. Security constraints have to be empha¬ 
sized, implying that semen should be stored in 
at least two locations, in order to be able to 
replace the stock accidentally lost in one loca¬ 
tion by using the quantities stored in the other. 
In addition, when only semen is stored, the 
renewal of a breed that is becoming extinct 
should be made possible in the absence of living 
females of the same breed. This requires grad¬ 
ing up any available female population using 
the stored semen through a backcrossing sys¬ 
tem over a sufficient number of generations. 

Labroue et al. (2001) have proposed the 
following general expression for the number of 
doses (D) of semen to be stored per pig breed 
in each location, as a function of d, the number 
of doses needed per insemination, N, the 
number of fertile gilts aimed at generation n of 
the repeat backcross scheme, and r, the 
expected number of inseminations needed to 
obtain a fertile daughter, i.e. the inverse of the 
number of fertile daughters per insemination: 
D = dN(r n - 1 )/(r - 1). The application of this 
formula to various situations has been dis¬ 
cussed by Labroue et al. (2001). Assuming 
d = 2, i.e. double-dose artificial insemination 
(AI), N = 25, a conservative value of 1.5 for r, 
and n = 6, it can be seen that over 2000 doses 
are needed to accommodate two locations. 
Labroue et al. (2001) also evaluated the cost of 


a breed collection under the above assump¬ 
tions, and assuming 25 boars collected, as rec¬ 
ommended by Smith (1984). On the basis of 
2000 doses (80 doses from each of the 25 
boars), their evaluation of €30,000 in 2001 
would presently amount to about €35,000 per 
breed. Annual storage costs represent about 
1% of this amount. 

Table 13.3 summarizes in vivo and in 
vitro conservation activities in various regions 
of the world, with the latter technique playing 
a predominant role. Apart from Asia and 
Europe, which harbour many local breeds and 
where about a quarter of these are being pre¬ 
served, a large variation can be seen - from a 
nearly complete preservation in North America 
(though of a small number of local breeds), to a 
contrastingly much lower coverage in Latin 
America and the Caribbean, and no reported 
in vitro conservation in Africa. 


The Use of Genetic Markers for 
Characterizing Genetic Diversity 

Although the number of breeds known to exist 
in a country, or in a region of the world, is a 
valid indicator of some diversity, it remains of 
limited value when comparisons are being made 
between different countries or regions, because 
the data presented very much reflect the infor¬ 
mation provided by each country. In spite of 
the efforts made to coordinate inventories 
across different countries or regions through 
DAD-IS, the information displayed should not 
be considered as necessarily representing a 
comprehensive and comparable coverage of 


Table 13.3. Pig conservation activities by region of the world. Adapted from FAO, 2007. 


Region of the world 

Africa 

America 

Latin America and 
the Caribbean 

North 

America 

Asia and 
the Pacific 

Europe 

Near 

East 

Total 

Number of local/ 

51 

70 

19 

243 

182 

1 

566 

regional breeds 
Number of breeds 

6 

2 

0 

60 

28 

0 

96 

conserved in vivo 
Number of breeds 

0 

7 

18 

92 

47 

0 

164 

conserved in vitro 
% conserved in vitro 

0 

10 

94.7 

37.9 

25.8 

0 

27.2 
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all situations. In addition, as said before, infor¬ 
mation on the production characteristics of the 
various breeds is mostly confounded with breed 
environment. In contrast, genetic markers offer 
the great advantage of an objective assessment 
of genetic diversity, which is by definition inde¬ 
pendent of environmental effects. Molecular 
markers, known to be highly polymorphic, 
ubiquitous over the genome and liable to auto¬ 
mated identification, are the tools of choice in 
the evaluation of genetic variability. The most 
used DNA marker technologies are simple 
sequence repeats (so-called microsatellites) and 
(arbitrary) amplification fragment length poly¬ 
morphism (AFLF). Microsatellites (MS) and 
AFLF are both numerous and dispersed over 
the pig genome, making them both suitable for 
biodiversity analyses. An important question, 
however, is whether marker variability well 
reflects variability at quantitative trait loci (QTLs) - 
for instance variability of production or fitness- 
adaptation traits. This aspect will be discussed in 
the next section of this chapter, including the 
prospects opened by a new class of markers 
called single nucleotide polymorphisms (SNPs). 


Genetic diversity between breeds 

One classical method of measuring genetic 
diversity is the calculation of fixation indices (F) 
as proposed by Wright in 1943 (see Nei, 1977). 
The F st in particular measures the degree of 
genetic differentiation among a set of popula¬ 
tions based on analyses of variance of allele 
frequencies (Weir and Cockerham, 1984). A dif¬ 
ferent approach is that of Nei (1973), who has 
proposed a concept of gene diversity based on 
the actual heterozygosities observed at several 
loci, and shown that total gene diversity can be 
partitioned into intra-populational and inter- 
populational gene diversities. Wright’s F ST and 
Nei’s G st both measure genetic differentiation 
between groups of individuals. A method for 
measuring genetic diversity based on Nei’s 
diversity parameters has been proposed by Petit 
et a/. (1998). 

Another method, advocated by an econo¬ 
mist (Weitzman, 1992), is based on the princi¬ 
ple of ‘monotonicity in species’, according to 
which the addition of any population i (species, 


breed, ...) to a set of populations (S) should 
increase its diversity (V) by an amount at least 
equal to the distance d(i, S) between breed i 
and the original set S, as represented by the 
expression: V(Sui) > V(S) + d(i, S). Here, d(i, 
S ) is the so-called ‘Hausdorff’ distance, defined 
as the distance between i and its closest neigh¬ 
bour in S. In fact, a distance in the mathemati¬ 
cal sense is not necessary; a dissimilarity 
measure will do, and in genetics we can take a 
classical distance such as the Nei-standard, 
Nei-DA, Cavalli-Sforza, Reynolds or Sanghvi 
(Foulley and Hill, 1999; Ollivier et a/., 2005). 
This principle led Weitzman to define the diver¬ 
sity of the set S as the maximum, over all mem¬ 
bers of the set, of the distance of a member 
from its closest neighbour in the set augmented 
by the diversity of the set without that member. 
Taking S\i to define the set without member i, 
V(S) = max jeS [d(i, S\i) + V(S\/)] measures the 
diversity of S. The maximization procedure is 
detailed in Weitzman (1992), and illustrated on 
a set of cattle breeds by Thaon d’Arnoldi et al. 
(1998). The method provides a way of measur¬ 
ing the contribution of any individual breed to 
the diversity of a given set of breeds, because 
the V function can obviously be applied to any 
subset of S. The contribution of any given 
breed i to between-breed diversity (CB) may be 
expressed as CB i = 1 - l/(S\i)/l/(S). This can 
also be viewed as an estimate of the breed’s 
genetic uniqueness. In addition, the method 
generates a rooted tree, which may be inter¬ 
preted as a taxonomic tree (see below), on 
which diversity can be ‘read’, as each branch 
length represents the contribution of the cor¬ 
responding breed to total breed diversity (see 
examples in the following subsection on 
taxonomy and clustering). A software imple¬ 
mentation of the Weitzman method, down to 
tree drawing, is available (Derban et a/., 2002, 
updated in 2005). 

As an illustration of the measurement of 
genetic diversity, a partitioning of MS diversity 
among 22 breeds of pigs is presented in Fig. 
13.1; the analysis is more completely discussed 
in Ollivier and Foulley (2009). The data are 
drawn from a European investigation described 
by Ollivier (2009). In this subsample, as in the 
complete study, which involved 68 European 
domestic breeds (see Fig. 13.2), more than half 
of the breed diversity could be assigned to the 11 
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Relative contribution 

Fig. 13.1. Partition of diversity among 22 breeds of pigs. Breed contributions relative to total between-breed 
diversity (3.6) were computed over 50 microsatellite loci, using the diversity function of Weitzman (1992), 
applied to pairwise Reynolds genetic distances (Reynolds etal., 1983). Breed codes are: MS - Meishan 
(China); PR - Presticke (Czech Republic); CR, LI, Y2 - Creole, Landrace, Yorkshire Large White (France); 
AS, BB, H2, L2, P1,Y1 - Angler Sattelschwein, Bunte Benheimer, Hampshire, Landrace, Pietrain, Yorkshire 
Large White (Germany); CS, D1, NS - Cinta Senese, Duroc, Nera Siciliana (Italy); PU - Pulawska (Poland); 
Bl - Bisaro (Portugal); RE - Retinto (Spain); D2, GO, HI, P2, TA- Duroc, Gloucester Old Spot, Hampshire, 
Pietrain, Tamworth (UK). Adapted from Ollivier and Foulley (2009). 


60 



CB (MS) CB (AFLP) CL/1/(MS) CW(AFLP) 


Fig. 13.2. Partition of European pig diversity among three categories of breeds based on microsatellite 
(MS) and amplified fragment length polymorphism (AFLP) molecular markers, as described in Ollivier etal. 
(2005). The three breed categories were: Loc, local; Int, international; Com, commercial line. CB, category 
contribution (%) to between-breed diversity, applying the method described for Fig. 13.1. CW, category 
contribution (%) to within-breed diversity, based on expected heterozygosities. MS, data from 50 
microsatellite loci on 68 European domestic breeds (total between-breed diversity 11.6 on the Reynolds 
distance scale, average expected heterozygosity 0.56). AFLP, data from 58 European domestic breeds 
(total between-breed diversity 4.5 on the Reynolds distance scale, average expected heterozygosity 0.11). 
Reproduced from Ollivier (2009) with the kind permission of Cambridge University Press. 
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local breeds. The method has already been 
applied to several farm animal species (see refer¬ 
ences in Ollivier and Foulley, 2009), and, as in 
this pig example, large contributions of ‘native’ 
breeds have been found in several other species. 

Other methods for analysing diversity 
have been proposed, essentially based on the 
concept of co-ancestry (Caballero and Toro, 
2002; Eding et a/., 2002). Those methods 
offer the additional possibility of adjusting the 
contribution of each breed to a ‘core set’ in 
order to maximize diversity. This refinement 
may be of interest to in vitro conservation with 
limited budgets. An application of the method 
of Caballero and Toro (2002) to Iberian pig 
diversity has been made by Fabuel et ai (2004). 
The method has been shown to give in prac¬ 
tice results very similar to the Weitzman method 
in terms of between-breed diversity, though it 
is liable to yield some negative breed contribu¬ 
tions (Ollivier and Foulley, 2005). 


Genetic diversity within breeds 

The need to incorporate within-breed variability 
into diversity assessments has often been 
emphasized in plant as well as animal genetic 
resources analyses (e.g. Petit et a/., 1998; 
Barker, 2001; Caballero and Toro, 2002). By 
the way, this may be seen as an equivalent for 
farm animals of the ‘principle of complementa¬ 
rity’ that is invoked in species conservation, 
whereby species are valued not only according 
to their dissimilarity from other species, but also 
according to their intraspecific diversity (Bonin 
et a/., 2007). The contribution of breed i to 
within-breed variability may be defined, in a way 
similar to CB, as CM/ = 1 - H(S\i)/H(S), where 
H(S) is the mean heterozygosity of the set S, and 
H(S\i) = iX/*i^(/)l/(n-l) is the mean hetero¬ 
zygosity of the set after excluding breed i, 
assuming n breeds in S, where / represents each 
breed of set S different from i. As 
XJLi H(S \ i) =li =1 H(i) = nH(S ), one can see 
that the contributions to within-breed diversity add 
up to zero over breeds. Negative CWs are thus to 
be expected for highly homozygous breeds, as 
their extinction raises the average heterozygos¬ 
ity of the remaining breeds. An example of within- 
breed diversity breakdown is given in Fig. 13.2. 


The emphasis on within-breed relative to 
between-breed variability may vary according 
to the conservation objective being pursued. 
Flexibility can be obtained by a global diversity 
index that gives appropriate weights to the CB 
and CW of the breeds (Ollivier and Foulley, 
2005). This general Weitzman-type approach 
thus offers a way to capture the maximum 
diversity in situations as diverse as, for instance, 
the design of synthetic lines for purposes of 
selection in farm animals such as the pig, or in 
the creation of experimental lines of laboratory 
animals for medical research. 


Allelic richness and allelic diversity 

Allelic richness, defined as the number of alleles 
per locus, is a diversity measure of obvious 
interest. The observed allelic richness, how¬ 
ever, needs correction for sample size, as the 
chances of discovering a new allele increase 
each time a new individual is observed. Several 
methods have been proposed for taking sam¬ 
ple size into account. Basically, allelic richness 
can be estimated either by rarefaction (El 
Mousadik and Petit, 1996) or by extrapolation 
(Foulley and Ollivier, 2006). 

In the first case (rarefaction), allelic rich¬ 
ness (r) is defined as the number of alleles 
expected to be seen in a sample of specified 
size g, which is the smallest sample of all breeds 
examined at a given locus. Each allele k, 
observed N ik times in a sample of size N, from 
population i, has a probability (P ik ) of not being 
seen in a sample of size g equal to 
P,[ 3) = C(N i - N ik ,g) / C(N n g) , where C(N,g) 
represents the number of combinations of N 
objects taken g at a time, so that allele k has a 
probability of 1 - P ik 3) of being seen in a sam¬ 
ple of size g. Then, the allelic richness of popu¬ 
lation i at any given locus is obtained as 

r, = Z?=i (1 - P ik {9) ) = K,~ Iti P ik 3 \ by add- 

ing the K l alleles observed in the population. 

In the second case (extrapolation), the 
idea is to estimate allelic richness (R) by adding 
to the number of alleles actually observed (K) 
in the population sample the number expected 
to be missing owing to sampling. Thus, 
R i = + 1km(1 - Kkm ) N c, where n v n 2 , ... n k 

are the overall allelic frequencies in the whole 
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sample of populations considered, and km 
means k e A (rn) , the set of alleles actually miss¬ 
ing in the i sample (Foulley and Ollivier, 2006). 
Notice the contrast between the two formulae. 
In the rarefaction situation, we subtract from 
the number of alleles actually observed (K) the 
expected number of alleles not seen in a sam¬ 
ple of smaller size g, whereas in the extrapola¬ 
tion case, we add to K, the expected number of 
alleles missing owing to the sampling of a finite 
specimen N f of the population. 

The concept of allelic richness leads to the 
slightly different concept of allelic diversity, which 
refers to the existence of alleles specific to some 
breeds. One has indeed to admit that a high 
number of alleles in a breed does not automati¬ 
cally guarantee their originality. A parallel to the 
measure of diversity may be drawn with the 
approach previously described for genetic diver¬ 
sity. Based on the total number of alleles observed 
in any set of breeds, which is the diversity func¬ 
tion of interest, the contribution of breed i to 
allelic diversity, equivalent to the CB j defined for 
genetic diversity, is the number of alleles present 
in population i and absent in all others, also 
called ‘private alleles’ (Ollivier and Foulley, 
2009). The parallel between genetic and allelic 
diversity cannot, however, be pursued further, 
because allelic richness is a property attached to 
each population, and there cannot be any varia¬ 
tion in within-breed allelic diversity. 

Here again, correction for sample size is 
needed to take into account sample size and the 
number of copies of each private allele. This can 
be done either through rarefaction or extrapola¬ 
tion, as explained in detail by Ollivier and Foulley 
(2009), who compare the two methods on the 
same sample of breeds. The diversity functions 
based on rarefaction and extrapolation will usu¬ 
ally give similar results, at least in terms of breed 
contributions to diversity. Extrapolation, how¬ 
ever, is recommended when sample sizes are, 
on average, small or highly unbalanced among 
populations (Foulley and Ollivier, 2006). 


Distinctiveness and conservation 

potential 

More generality may be given to the diversity 
functions considered above by taking into 


account the risks of extinction of each breed. 
Risks of extinction can be evaluated as indicated 
previously, using various criteria, examples of 
which can be found for African cattle in Reist- 
Marti et al. (2003) or for European pigs in 
Ollivier et al. (2005). Based on the survival 
probabilities, P, of each of n breeds over a 
given period of time, 2" extinction-survival pat¬ 
terns may occur with given probabilities and, 
assuming those events to be independent, an 
expected diversity can be defined, possibly dis¬ 
counted to present (Weitzman, 1993): E(V) = 'L 
P(Q)V(Q), where V(Q) is the diversity of subset Q 
(as previously defined for the diversity of set S, 
V(S)), and P(Q) is its probability. 

The distinctiveness (D) of breed i is 
obtained as the partial derivative of expected 
diversity with respect to P,. D, then represents 
the increase in expected diversity within a given 
period of time with respect to the increase in 
survival probability of breed i (Weitzman, 1993). 
For instance, in the case of three breeds, the 
genetic distinctiveness of one breed (e.g. Df ) is 

D\ = P 2 (1 - P 3 ) V 12 + (1 - P 2 ) P 3 V 13 + P 2 P 3 (V 123 - 

V 23 ), where P, is the probability of survival of 
breed i, and V 12 is the diversity of the corre¬ 
sponding subset 12, V 13 the diversity for subset 
13, etc. A parallel expression holds for allelic 
distinctiveness, defined as =P 2 (1 -P 3 )d (1,2) + 
(1 -P 2 ) P 3 d (1 - 3) +P 2 P 3 d (1,23), where d(/,Q) is 
the allelic distance (or dissimilarity) between 
breed i and subset Q (here either breed 2, 
breed 3 or subset 23). This distance is defined 
as the number of alleles present in breed i and 
absent in subset Q, which is also obviously a 
‘Haussdorf’ distance because it is equal to the 
number of alleles present in breed i and absent 
in its closest neighbour in Q (see Equation 23 in 
Weitzman, 1998). An example of comparative 
genetic and allelic distinctiveness of 11 endan¬ 
gered breeds of pigs is given in Ollivier and 
Foulley (2009), where a wide variation in breed 
distinctiveness is observed, ranging from zero 
to 24%. The product D i E i , where £, = 1 - P„ 
has been named the conservation potential by 
Weitzman (1993). This can be shown to be the 
optimal ranking criterion for establishing cryo- 
preservation priorities under a budget constraint, 
in the particular situation (likely to apply to pigs) 
when cryopreservation can make a breed safe 
and cryogenic preservation costs are equal over 
different breeds (Weitzman, 1998). 
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Taxonomy and clustering 

In addition to analyses of genetic diversity, 
genetic distances can also be used to draw trees, 
which are often termed phylogenetic trees. The 
term implicitly refers to evolutionary theory, 
where diversity arises from speciation, i.e. the 
division of one ancestor species into two differ¬ 
ent subspecies. Such a pattern of evolution can¬ 
not generally apply to farm animal breeds, 
except in particular short-term situations when 
one breed happens to be subdivided into two 
new ones. Domestic breeds do not remain as 
distinct as species do and the tree-like branching 
process in species evolution does not quite apply 
to farm animal breeds. The trees drawn must be 
considered as telling the evolutionary story that 
best fits the diversity observed, but not as neces¬ 
sarily telling the ‘true’ story (Weitzman, 1992). 
The trees drawn are best viewed as classification 
tools, showing taxonomies rather than true phy- 
logenies. Quite complex migration-admixture 
patterns usually prevail, as well as occasional 
unreported mixing of breeds. The resulting trees 
therefore often disagree with documented ‘old’ 
breed history and supposed breed proximities 
(e.g. in Porter, 1993), as discussed for European 
local breeds by Ollivier (2009). 

An example of taxonomy is given in Fig. 
13.3. This is the rooted tree generated by 
applying the Weitzman diversity function to the 
subset of breeds considered in Fig. 13.1. The 
longest branch is that of the Chinese Meishan, 
as expected for such an ‘exotic’ breed in the 
context of this European investigation. The 
graph also shows the non-additivity of individ¬ 
ual breed contributions, as the joint contribu¬ 
tion of Gloucester Old Spot and Tamworth 
on the left of the graph, represented by the 
ordinate of their node (below 0.4), is much less 
than the sum of their individual branch lengths 
(above 0.6), the reverse being seen for the two 
Duroc strains located just beside them. 

Trees also show clustering phenomena, as 
exemplified in Fig. 13.3 for the two varieties of 
each of the five international breeds, Duroc, 
Hampshire, Landrace, Large White (Yorkshire) 
and Pietrain, a pattern that is confirmed 
when all varieties of each breed are considered. 
In contrast, no clustering of local breeds with 
international breeds appears, thus suggesting 
that the uniqueness of European local breeds 


reflects their having evolved apart from main¬ 
stream international breeds. 


Assignment of individuals to breeds 
and population structure 

Genetic markers can also be used to assign indi¬ 
viduals of unknown origin to populations (e.g. 
breeds or lines). Two cases have to be 
distinguished here, usually reported in the 
literature as ‘supervised’ and ‘non-supervised’. 
In the first case, there is a reference set of pop¬ 
ulations out of which samples of individuals 
have been typed for genetic markers. The ques¬ 
tion then is to know which one of these popula¬ 
tions an individual of unknown origin could be 
assigned to. In the second case, the structure of 
the populations (numbers and characteristics of 
each of them) is itself unknown and the prob¬ 
lem is to draw inferences about that structure. 

By using the estimated allele frequencies 
in each breed at several unlinked loci, one can 
compute the likelihood that a given genotype 
belongs to a given breed, and then assign each 
individual to the breed showing the largest like¬ 
lihood (Paetkau et a/., 1995). In a study of 11 
breeds of pigs using 18 microsatellites, it was 
thus possible to correctly assign all animals 
(Laval et a/., 2000). A potential advantage of 
AFLP over microsatellites is the possibility of 
selecting loci yielding ‘private fragments’ for 
assignment analyses. This has been shown to 
be feasible by Alves et al. (2002), who were 
able to discriminate Duroc from Iberian pigs, 
owing to 14 private Duroc fragments out of 
139 fragments examined. Bayesian procedures 
can also be applied; these are especially well 
fitted to take into account missing data at some 
loci and uncertainty in the knowledge of gene 
frequencies (Baudouin et al., 2004). 

When no objective information is available 
to define the set of populations examined, 
model-based clustering methods, as opposed to 
the standard distance-based methods presented 
above, can be used. Such clustering methods 
have been proposed by Pritchard et al. (2000) 
and Dawson and Belkhir (2001), and they have 
been widely applied to natural populations. 
These methods have also gained some popular¬ 
ity in studies of breeds of farm animals, though 
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Fig. 13.3. Weitzman taxonomic tree. Rooted tree generated by the diversity function of Weitzman (1992), 
using the matrix of genetic distances from Fig. 13.1. Breed codes: MS - Meishan (China); PR - Presticke 
(Czech Republic); CR, LI, Y2 - Creole, Landrace, Yorkshire-Large White (France); AS, BB, H2, L2, PI, Y1 - 
Angler Sattelschwein, Bunte Benheimer, Hampshire, Landrace, Pietrain, Yorkshire Large White (Germany); 
CS, D1, NS - Cinta Senese, Duroc, Nera Siciliana (Italy); PU - Pulawska (Poland); Bl - Bisaro (Portugal); 
RE - Retinto (Spain); D2, GO, HI, P2, TA - Duroc, Gloucester Old Spot, Hampshire, Pietrain, Tamworth 
(UK). Adapted from Ollivier and Foulley (2009). 


in this case the populations are usually well 
defined and the usefulness of the method may 
be questioned. The method may, however, help 
to confirm the degree of genetic robustness of 
the actual breed structure and reveal possible 
admixtures. In fact, supervised methods of 
assignment may gain efficiency if some classifi¬ 
cation information among reference popula¬ 
tions is also taken into account in the statistical 
procedure, as it is more difficult to assign indi¬ 
viduals among close populations (e.g. among 
Large White lines) than among distant ones 
(e.g. Large White versus Meishan), as shown by 
Benezech (2008, unpublished results). 

Comparisons between types of markers 

Both microsatellites and AFLP have been 
extensively used in the pig, but rarely on the 


same set of breeds. When this was done, and 
meaningful comparisons could be made, it 
appeared that overall genetic diversity in AFLP 
was considerably below that in MS, both 
within-breed (0.12 versus 0.56 for expected 
heterozygosity) and between-breed (0.11 ver¬ 
sus 0.23 for F st ) (Foulley et a/., 2006). In spite 
of these differences in total diversity, the indi¬ 
vidual breed contributions to both diversities 
were positively correlated between the two 
markers (see the similarity between the MS and 
AFLP graphs in Fig. 13.2). The correlations 
(r = 0.5), however, were moderate and some¬ 
what lower than would be expected if the 
evolution of the two markers had been gov¬ 
erned mainly by genetic drift. This suggests 
that MS and AFLP may carry different diversity 
information (Foulley et a/., 2006), in accord¬ 
ance with what has been reported in plants 
(see the review by Nybom, 2004). 
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Relationships Between Molecular 
and Quantitative/Adaptive Trait 
Divergence Revealed by Genetic 
Resources Investigations 

Tests of marker neutrality 

Studies of DNA markers offer objective meas¬ 
ures of diversity which can help decision mak¬ 
ers to identify the most unique breeds, which 
may then be prioritized for conservation, as 
we have seen above. They also provide infor¬ 
mation on the history of domestication (e.g. 
Bruford et a/., 2003). The relevance of 
molecular markers in studies of diversity, how¬ 
ever, has been questioned on the grounds that 
neutral loci are being used, and these may not 
reflect differences between populations at the 
loci under selection. We know, however, that 
neutral genes can be affected by the selection 
applied to neighbouring genes, a phenome¬ 
non known as gene hitchhiking (Maynard 
Smith and Haig, 1974). Selection acts on the 
whole genome, and diversity should be viewed 
in the context of multi-locus systems. This 
situation is, in fact, exploited in reverse in 
marker-assisted selection procedures using 
markers close to QTLs. Quite extensive QTL 
maps are now available for several farm ani¬ 
mals, including the pig (Hu and Reecy, 2007). 
Most of the microsatellites used in pig diver¬ 
sity studies are, indeed, linked to a large 
number of quantitative traits. This is the case 
in particular for the ISAG (International 
Society for Animal Genetics)/FAO panel of 
microsatellites listed in Laval et a/. (2000), as 
shown in Table 13.4, established from the infor¬ 
mation available in the pig QTL database 
described by Hu et a/. (2005). One would 
then a priori expect to find some correlation 
between marker and quantitative trait diver¬ 
sity, particularly for the markers most closely 
linked to QTLs. The testing of marker neutral¬ 
ity, however, is a challenging task. The tests 
are based on genome scans of DNA polymorph¬ 
isms because the adaptation of domestic 
breeds to local conditions or to specific pro¬ 
duction objectives is expected to generate 
changes in within-breed and/or between- 
breed diversity at those loci underlying the 
traits under selection, and at nearby neutral 


marker loci. This is a subject also extensively 
investigated in natural populations (as 
reviewed, among others, by Storz, 2005). 

The neutrality tests based on relative lev¬ 
els of diversity within populations exploit the 
reduction of variability around a selected locus 
owing to hitchhiking, a phenomenon called 
‘selective sweep’. A test based on the variance 
of repeat number at microsatellite loci has 
been proposed by Schlotterer et a\. (1997). 
Low variances for particular locus x popula¬ 
tion combinations reveal directional selection, 
whereas high variances indicate within-breed 
balanced polymorphisms. This test, however, 
is sensitive to the model assumed, and a more 
robust test, which compares two groups of 
populations for a large number of microsatel¬ 
lite loci, has been proposed by Schlotterer 
(2002). The test statistic is the log of the ratio 
of variance in repeat number. An application 
of this test to a set of European pig breeds 
typed for 50 microsatellite markers showed 
the existence of ‘outlier’ loci with contrasting 
allele size distributions between groups of 
breeds, indicative of selective sweeps (Ollivier 
and Foulley, 2009). 

Differentiation between populations as 
measured by F ST is the basis of an early test of 
selective neutrality (Lewontin and Krakauer, 
1973). The basic argument behind the 
Lewontin-Krakauer (LK) test is that, under the 
null hypothesis of neutrality, differentiation at all 
loci should be the same. The observed variance 
of F st across marker loci can thus be tested 
against its expected value under the assumption 
of neutrality. Several improvements of the LK 
test have been proposed (reviewed by Ollivier 
and Foulley, 2009), which are essentially for 
taking into account the pattern of relationships 
among populations. After correction for this 
effect, the LK test showed a particularly large 
departure from neutrality for AFLF in a set of 
European pig breeds (Foulley et a/., 2006). 


Perspectives opened by single 
nucleotide polymorphisms (SNPs) 

The lack of a sufficient number of genetic 
markers is recognized as a major limitation in 
the task of detecting marker-trait associations, 
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Table 13.4. Number of traits reported in the literature as being linked to the 27 ISAG (International 
Society for Animal Genetics)/FAO Advisory Group recommended microsatellites. Data on QTL flanking 
markers from the Pig Quantitative Trait Locus (QTL) database (Pig QTLdb) at http://www.animalgenome. 
org/cgi-bin/QTLdb/SS/index (as of December 2009). 



Chromosome 

Number of 



Trait class 3 



Marker 

arm 

traits 

Meat quality 

Production 

Health 

Exterior 

Reproduction 

CGA 

ip 

11 

8 

2 

— 

1 

— 

SOI 55 

iq 

9 

5 

1 

1 

1 

1 

SW240 

2p 

27 

20 

1 

4 

1 

1 

S0226 

2 q 

2 

2 

— 

— 

— 

— 

SW72 

3p 

40 

24 

7 

5 

1 

3 

S0002 

3q 

1 

1 

— 

— 

— 

— 

S0227 

4p 

14 

8 

4 

1 

— 

1 

S0005 

5q 

22 

17 

4 

1 

— 

— 

IGF1 

5q 

7 

5 

1 

— 

— 

1 

SW122 

6q 

27 

26 

— 

— 

1 

— 

S0228 

6q 

19 

19 

— 

— 

— 

— 

SW632 

7q 

8 

5 

— 

— 

3 

— 

SOI 01 

7q 

19 

11 

4 

1 

2 

1 

S0225 

8q 

9 

3 

— 

6 

— 

— 

SOI 78 

8q 

4 

2 

— 

— 

— 

2 

SW911 

9p 

20 

15 

1 

— 

3 

1 

SW951 

10q 

9 

4 

3 

— 

— 

2 

S0386 

i iq 

2 

1 

— 

— 

— 

1 

S0090 

12q 

21 

13 

1 

2 

4 

1 

S0068 

13q 

24 

20 

4 

— 

— 

— 

S0215 

13q 

6 

3 

2 

1 

— 

— 

SW857 

14q 

12 

11 

1 

— 

— 

— 

S0355 

15q 

6 

3 

1 

— 

2 

— 

SW936 

15q 

27 

20 

3 

1 

1 

2 

S0026 

16q 

21 

20 

1 

— 

— 

— 

SW24 

17q 

— 

— 

— 

— 

— 

— 

S0218 

Xq 

— 

— 

— 

— 

— 

— 

Total number of trait-linked markers 
per trait class 

25 

17 

10 

11 

12 


a As defined on the above-mentioned QTL database web site; ‘meat quality’ includes carcass traits, ‘production’ mainly 
covers growth traits, ‘exterior’ includes behaviour. 


in spite of the considerable numbers of loci 
from MS and AFLP already available. In 
recent years, new sequencing technologies 
have appeared and allowed the design of 
high-density genotyping assays. Large num¬ 
bers of single nucleotide polymorphisms 
(SNPs) have thus been identified in several 
species of domestic animals (references in 
Ramos et a/., 2009). This kind of genetic 
marker is especially appealing, as SNPs are 
biallelic, very common (one per 1000 bp) and 
easily assayed. SNPs have also been recently 
identified in the pig by Ramos et al. (2009). 


Using a ‘PorcineSNP60 Beadchip’, they were 
able to reliably score 62,621 loci, of which 
58,994 were polymorphic in the 158 pigs 
they used. The availability of such a tool now 
makes it possible to more efficiently detect 
outlier loci by using a model-free approach 
along the line of the LK test previously 
described, as in the human example of Akey 
et al. (2002). Other methods for distinguish¬ 
ing loci under selection from neutral loci 
require assumptions on the demographic his¬ 
tory of the populations and may be sensitive 
to the model implemented. Model-based 
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approaches using Bayesian hierarchical 
methods have recently been developed and 
applied to SNP data (Beaumont and Balding, 
2004; Riebler et a/., 2008; Gautier et a/., 
2009). Approximate Bayesian Computation 
(ABC) also allows the implementation of 
more sophisticated models of molecular 
evolution (Pritchard et a/., 2000; Beaumont 
et al., 2002). SNPs thus open a new field of 
investigation in pig genetic diversity, already 
exemplified in cattle by the Bayesian scan of 
Gautier et al. (2009). 

SNPs will also help to evaluate the extent 
of linkage disequilibrium (LD) in the various 
breeds of pigs. LD is a non-random associa¬ 
tion of genes at different loci, and is known to 
decrease rapidly with increasing map distance 
of the loci considered. Detecting significant 
LD, therefore, needs narrowly spaced genetic 
markers. One of the earliest studies of LD in 
pigs actually used 15 MS, spaced 5cM on 
average, and was able to show significant LDs 
on two pig chromosomes (Nsengimana et al., 
2004). The availability of SNPs now opens 
the way to more detailed investigations, as 
shown by a recent comparison by Amaral 
et al. (2008) bearing on ten European breeds, 
ten Chinese breeds and a European wild boar. 
This study of 371 SNPs revealed more 
extended LD in Europe than in China, with 
the wild boar in an intermediate position. 
Interestingly, this study also showed a very 
highly significant breed x genome region 
interaction for LD, which might reflect differ¬ 
ential selection pressures across genomic 
regions among breeds. LD might then be an 
additional component of genetic diversity 
worth considering. 


Conclusions 

Interest in farm animal genetic resources, includ¬ 
ing the pig, emerged in the early 1960s, but, as 
noted by Barker (1986), for the following quar¬ 
ter of a century, there was ‘a great deal of talk 
and relatively little action’. The situation has 
since undoubtedly changed. As we have seen in 
this chapter, substantial advances have been 
made in the inventories of the pig genetic 
resources, in their management and conserva¬ 
tion, and in the characterization of their genetic 
diversity. The need to properly evaluate diversity 
and to put its management into an economic 
context, integrating diversity and vulnerability, can¬ 
not be overemphasized. International cooperation 
is essential, as is funding, as has been recently 
exemplified by the enormous task of assessing 
the status and trends of animal genetic resources 
worldwide, based on 169 country reports (FAO, 
2007). More research is certainly needed for 
objectively assessing fitness and adaptation traits 
among populations (van der Werf et al., 2009), 
and on the controversial issues around molecular 
and quantitative trait diversity. Molecular charac¬ 
terization of pig genetic resources should be 
encouraged, both as a tool for managing the 
resources and to throw more light on the genetic 
basis of adaptive divergence, so as to increase our 
knowledge of the pig genome to better preserve 
its diversity. Help will be provided in that respect 
by new molecular technologies, as exemplified 
by the PorcineSNP60 Beadchip recently devel¬ 
oped (Ramos et al., 2009), as well as by new 
computing methods based on more complex 
models of population genetics (e.g. Beaumont 
et al., 2002) that are applicable to farm animal 
populations. 
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Introduction 

Performance of an animal destined for market 
in a pork production system is defined by the 
efficiency with which it develops saleable prod¬ 
uct. That efficiency is largely determined by 
costs associated with feed and time, and by the 
amount of quality lean tissue produced. Genetic 
merit for market animal performance is 
improved in seed-stock populations by per¬ 
formance testing and selection. For successful 
commercial pork production, a breeding sys¬ 
tem must be implemented that optimizes mar¬ 
ket pig performance and reproductive 
efficiency. Thus, the genetics of performance 
encompasses not only additive and non-additive 
genetic effects associated with feed intake and 

©CAB International 2011. The Genetics of the Pig, 
2nd Edn (eds M.F. Rothschild and A. Ruvinsky) 


tissue growth, but also strategies for imple¬ 
menting performance testing and selection, 
and consideration of the genetic correlations 
between market pig performance and traits of 
reproduction. 

Performance Testing and Selection 

Strategies 

Classical index selection 

A classical approach to multi-trait selection 
involves the derivation of a linear index of pheno¬ 
typic measurements with weightings that maxi¬ 
mize its correlation with a selection objective 
(Hazel, 1943). The selection objective is defined 
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as a linear combination of breeding values for 
traits considered of economic importance, and 
phenotypic measurements are chosen as criteria 
with which to most effectively estimate genetic 
merit for the selection objective. The traits that 
define the selection objective may or may not be 
among the phenotypic measurements chosen, 
depending on the difficulty of measurement and 
the availability of correlated indicator traits. 
Estimates of genetic and phenotypic variation 
and covariation associated with the measured 
traits and those in the objective, and the relative 
economic values of traits in the objective, are 
used to derive the optimum combination or index 
of the phenotypic selection criteria. Thus, this 
classical selection index is sometimes also referred 
to as an economic index of genetic merit. 

Because the overriding objective of a pork 
enterprise is the efficient production of quality 
lean meat, typical selection indices applied in 
the industry have reflected this purpose by relat¬ 
ing to a breeding objective that includes genetic 
merit for leanness, growth rate and feed effi¬ 
ciency. Phenotypic measurements have tradi¬ 
tionally included post-weaning growth (days of 
age at an ideal market weight or rate of gain 
from around 25 kg to market weight) and live- 
animal backfat thickness at market weight. In 
recent decades, advancements in ultrasound 
technology have facilitated routine inclusion of 
loin muscle dimensions as well as backfat thick¬ 
ness of the live animal at the end of the testing 
period. Less frequently, phenotypic records 
may include individual feed intake and efficiency 
during the post-weaning period and carcass 
information from relatives. As a result, the cor¬ 
responding phenotypic index usually includes 
an ultrasonic measurement of backfat depth 
and loin muscle dimensions at market weight 
and average daily gain for the test period, but 
often does not include direct measurements of 
feed intake and efficiency. 

Hence, while one of the advantages of the 
classical index is the ability to select on an 
objective trait through correlated phenotypic 
measurements, the method requires reasona¬ 
bly accurate estimates of genetic correlations 
among all traits and of relative economic val¬ 
ues. Also, an underlying assumption is often 
made that the trait measured during perform¬ 
ance testing in the nucleus population (e.g. 
average daily gain) is perfectly correlated with 


the same trait in the commercial system being 
targeted (i.e. it has the same genetic control in 
both populations). 

Interactions with environments 

The general structure of a livestock industry, 
with respect to the creation and dissemination 
of genetic improvement, can be described as a 
pyramid with nucleus, multiplier and commer¬ 
cial levels. Although selection may be prac¬ 
tised at all levels, it is selection at the nucleus 
that determines the rate of permanent genetic 
improvement in the industry. Thus, the selec¬ 
tion objectives addressed in nucleus herds must 
accurately reflect production goals at the com¬ 
mercial level. 

Testing methods in nucleus populations are 
designed to provide unbiased estimates of 
genetic potential, and generally result in rela¬ 
tively uniform performance and greater herita- 
bility. But the testing environment under which 
candidates for selection are evaluated in nucleus 
herds is often different from the commercial 
production environment. For example, extraor¬ 
dinary effort is made through rigorous biosecur¬ 
ity methods to prevent the exposure of nucleus 
animals to swine pathogens. Boars tested in 
nucleus populations are typically penned in 
small groups or even individually, and may 
receive feed ad libitum , or some form of 
restricted or semi-restricted feeding, such as 
scheduled feedings to appetite. Commercial 
market pigs, on the other hand, are usually 
penned in larger groups and are at greater risk 
of exposure to pathogens, and, while most com¬ 
mercial producers in the USA allow pigs free 
access to feed, restricted feeding in the latter 
part of the post-weaning period is more com¬ 
mon in European commercial units. 
Consequently, the benefits of greater heritability 
from reduced phenotypic variance in nucleus 
testing may be offset by genetic correlations 
between performance in the nucleus testing 
environment and in the commercial production 
setting of less than unity (Brascamp et a/., 
1985; Mulder and Bijma, 2005), resulting 
in less genetic progress. In a summary of litera¬ 
ture results (Merks and de Vries, 2002), genetic 
correlations between environments, or between 
purebreds and crossbreds, were close to unity 
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for some traits but significantly less for others 
(e.g. for backfat, 0.5 to 0.8). Lutaaya et al. 
(2001) estimated genetic correlations between 
each of two purebred lines (A and B) with their 
reciprocal crosses (C). Estimates of genetic cor¬ 
relations were different in the two crosses for 
both lifetime gain (A-C: 0.99; B-C: 0.62) and 
backfat (A-C: 0.32; B-C: 0.70). Two strains of 
commercial Durocs (PI and P2) were used to 
estimate genetic correlations for performance 
traits in nucleus pigs and their crossbred prog¬ 
eny (Cl and C2, respectively) (Zumbach et a/., 
2007). Estimates for weight at last measure¬ 
ment, weight per day of age, backfat and loin 

depth were 0.53 ± 0.08, 0.60 ± 0.07, 0.83 ± 
0.09 and 0.78 ± 0.05, respectively, based on 
Pl-Cl, and 0.80 ± 0.10, 0.79 ± 0.07, 0.89 ± 
0.05 and 0.80 ± 0.08, respectively, based on 
P2-C2. Although differences in the method and 
time of measurement in nucleus versus cross¬ 
bred animals may have affected some of the 
genetic correlation estimates in these studies, 
the genetic correlation for some traits in some 
populations is likely to be different enough from 
unity to warrant consideration of breeding meth¬ 
ods (such as the inclusion of commercial data in 
nucleus selection schemes) to account for geno¬ 
type x environmental interactions. 

Variation in testing environments can also 
change the effective selection objective for a 
given set of measurements. For example, single¬ 


trait selection for rate of gain among animals 
that have ad libitum access to feed puts empha¬ 
sis on appetite and may result in increased fat¬ 
ness (Woltmann et al., 1992, 1995). However, 
selection solely for gain among animals limited 
to a standard amount of feed may emphasize 
lean gain by identifying those animals that parti¬ 
tion the allotted feed to the relatively efficient 
process of lean deposition (Webster, 1977). 
Several models of metabolizable energy utiliza¬ 
tion and resulting tissue growth have been 
described for the pig (e.g. Whittemore and 
Fawcett, 1976; Whittemore, 1986; Moughan 
et al., 1987). Performance testing environ¬ 
ments and selection schemes based on these 
models have been proposed as alternatives with 
which to address selection objectives associated 
with post-weaning performance. 

Models of growth and alternative 

strategies 

Whittemore (1986) described a relatively sim¬ 
ple model based on a deductive approach in 
which the objective is to determine the causal 
forces that result in animal tissue growth. The 
model is best summarized in terms of daily rates 
of lean and fat tissue gain in response to daily 
feed intake (Fig. 14.1). The animal’s genetic 
potential for maximum rate of lean tissue 



Feed supply (kg/day) 


Fig. 14.1. A hypothesis for the 
relationship between daily gains 
of lean and fat and the daily feed 
supply. From Whittemore (1986, 
p. 619) with permission from 
Journal of Animal Science. 
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deposition is relatively constant beginning at an 
early age, and over nearly the entire range of 
ages and weights associated with the post- 
weaning production period. Therefore, as daily 
feed intake is increased, rate of lean gain 
increases linearly until it plateaus when the ani¬ 
mal’s maximum potential is reached. The area 
to the left of the point at which lean gain pla¬ 
teaus may be thought of as nutritionally limited, 
and the area to the right, in which maximum 
lean gain is realized, as nutritionally unlimited. 

During the nutritionally limited phase of 
growth, it is assumed that the animal will parti¬ 
tion most of the available energy to lean growth, 
while maintaining some physiologically normal, 
but minimal, level of fat gain. In the nutritionally 
unlimited phase of growth, most of the feed 
consumed beyond what is needed for maximum 
lean gain is partitioned to fat deposition. Thus, 
when feed intake is allowed to increase into this 
range the animal begins to deposit fat rapidly, 
and grows less efficiently and eventually more 
slowly owing to the relatively greater energy 
requirements of fat deposition. 

The principles described in this model are 
the basis for post-weaning production environ¬ 
ments in which ad libitum feed intake is 
restricted, a practice common in European sys¬ 
tems. The objective of restricting access to feed 
by pigs in finishing facilities is to limit daily feed 
intake to the minimum level at which maxi¬ 
mum lean gain is realized, thereby avoiding 
unnecessary and costly fat gains. In some pop¬ 
ulations, feed intake in the latter stages of the 
post-weaning period can be restricted to as 
much as 75% of ad libitum without inhibiting 
lean growth potential (Fowler et a/., 1976). 
Restricting post-weaning feed intake is some¬ 
times referred to as scale feeding because sys¬ 
tems are typically scaled so that each pig’s daily 
allotment of feed is increased as the post- 
weaning period progresses according to either 
time or body weight. 

As depicted in graphs a and b of Fig. 14.1, 
a given amount of daily feed intake can result in 
nutritionally unlimited growth in one animal, 
but in nutritionally limited growth of another, 
depending on their relative genetic merit for 
maximum lean gain. Differences in lean gain 
potential between sexes of pigs and between 
some genetic strains of pigs have been well 
quantified (Campbell and Taverner, 1988; 


Eissen, 2000). Maximum lean gain potential is 
also assumed to vary among individuals of the 
same sex within a breed or genetic strain, and 
thereby becomes a potential component of the 
breeding programme objectives. Consequently, 
the principles outlined here become useful in 
the development of hypotheses regarding 
expected response to selection for perform¬ 
ance traits, and the design of testing and selec¬ 
tion schemes to improve genetic potential for 
lean gain and the efficiency of lean gain. 

Fowler et at. (1976) attempted to model 
interactions between selection (genetic) object¬ 
ives, nutritional environments during perform¬ 
ance testing and nutrition provided during 
commercial production. The model of growth 
that they considered (Fig. 14.2) was similar to 
that of Whittemore (1986). Metabolizable energy 
intake not lost as heat, i.e. energy retained as 

(a) (b) 



(c) (d) 



Fig. 14.2. Schematic models showing different 
ways in which improvement of lean tissue feed 
conversion (LTFC) may occur. PE, product energy 
and LT, lean tissue: (a) simultaneous increase in 
lean tissue growth rate (LTGR) and decrease in 
mean rate of feed intake; (b) decrease in mean 
rate of feed intake; (c) increased LTGR with rate of 
feed intake unconstrained; (d) increased LTGR 
with rate of intake kept constant. From Fowler etal. 
(1976, p. 379) with permission from the British 
Society of Animal Science. 
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product (PE), is partitioned either into skeletal 
muscle and essential accessory tissues, categor¬ 
ized together as lean tissue (LT), or into fat tissue. 
In an environment in which the pig has ad libi¬ 
tum access to feed, energy will be partitioned to 
the deposition of lean tissue until the maximum 
potential for lean growth rate is met, after which 
energy will be deposited as fat. The nutritional 
environments considered, both for performance 
testing and for the commercial production sys¬ 
tem, were scale feeding and ad libitum access 
to feed. In the context of performance testing, 
the time- or weight-based feeding scale would 
be designed to limit intake to a proportion of 
ad libitum, such that variation in appetite is 
not expressed but restraint of lean growth is 
minimized. Discussion of ad libitum access 
also applies, to some extent, to systems in 
which animals are allowed to eat freely for a set 
period of time twice each day, i.e. semi-ad 
libitum. 

As an alternative to the classical selection 
index objective, expressed as a linear function 
of breeding values for traits deemed to have 
economic importance, Fowler et al. (1976) 
suggested a more biological definition of the 
objective based on physiological factors related 
to the market value of the pig. Because the 
primary product of the industry is lean pork 
and the largest costs to the system are those 
associated with feed and time, the two ‘biologi¬ 
cal’ objectives suggested were lean tissue 
growth rate (LTGR) and the feed required per 
unit of lean produced or lean tissue feed con¬ 
version (LTFC). 

The simplest of these biological indices 
would use either LTGR or LTFC as the selec¬ 
tion objective and as the selection criterion. In 
practice, selection for the objective LTGR 
would be based on the difference between esti¬ 
mates of lean content at the onset of the test 
and at the end of the test, in each case based 
on a function of live weight and ultrasonic 
measurements of subcutaneous fat depth and 
loin muscle area or depth. In a testing environ¬ 
ment in which candidates are given free access 
to feed, selection for the objective LTFC would 
require the direct measurement or estimation of 
individual feed intake. When performance test¬ 
ing is conducted with a time-scale feeding sys¬ 
tem, in which pigs start the test at a standard 
weight, are tested for a standard length of time 


and are given an amount of feed based on time 
of test, variation in feed intake and in days of 
test is zero by design. Consequently, the bio¬ 
logical indices LTGR and LTFC are perfectly 
correlated under this system. 

Three primary testing/selection scenarios 
were considered by Fowler et at. (1976): Case 1, 
testing environment - ad libitum, objective - 
LTFC; Case 2, testing environment - scale feed¬ 
ing, objective - LTFC (LTGR); and Case 3, testing 
environment - ad libitum, objective - LTGR. 

Case 1: Response to selection for LTFC is 
expected to be through a reduction in genetic 
potential for feed intake (appetite) and an 
increase in lean tissue gain (Fig. 14.2a). The 
relative emphasis on decreased feed intake will 
be greater when full expression of appetite is 
allowed, and the emphasis may shift almost 
entirely towards reduced feed intake if the esti¬ 
mate of lean gain is poor, or if a greater amount 
of consideration is given to leanness rather 
than lean gain (Fig. 14.2b). Decreased appetite 
may presently be desirable in some industry 
populations in which lean gain is not limited by 
intake. However, as lean gain potential nears 
the limit presented by ad libitum energy 
intake, selection emphasis would need to be 
reversed to improve LTFC (Fig. 14.2c). 

Case 2: In this case, the testing environment of 
scale feeding is aimed at removing variation in 
feed intake without limiting the expression of 
lean gain potential. Improvement of LTFC is 
entirely through selection emphasis on LTGR, 
with genetic potential for feed intake uncon¬ 
strained (Fig. 14.2c). As lean gain potential is 
increased, additional improvements in LTGR and 
LTFC under this scenario will eventually require 
increased emphasis on appetite. However, unlike 
Case 1, a reversal in the selection pressure on 
feed intake would not be necessary. 

Case 3: This is typical of many on-farm test¬ 
ing situations in which individual feed intake is 
not measured. Feed intake is expected to 
remain unchanged under this scenario 
(Fig. 14.2d) until lean gain potential becomes 
limited, after which improvements in LTGR 
must be accompanied by increased feed intake 
(Fig. 14.2c). This method avoids the cost of 
measuring individual feed intake, but only 
addresses one component of LTFC. 
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Based on the principles of the model, the 
authors also determined expectations for the 
performance of animals resulting from the test¬ 
ing/selection scenarios when used in commer¬ 
cial systems that provided either ad libitum or 
restricted access to feed. They concluded that 
pigs from Case 1 would be most likely to pro¬ 
duce acceptable carcasses in systems allowing 
ad libitum feed intake. Figs from Cases 2 and 
3 would be expected to perform better in sys¬ 
tems with restricted intake than with ad libitum 
intake, and in a lean-based marketing system. 

Another alternative strategy for improving 
the efficiency of lean growth is more directly 
aimed at the reduction of residual feed intake, 
i.e. the amount by which actual ad libitum feed 
intake differs from that expected to be required 
for maintenance and maximum deposition of 
lean tissue. To obtain an estimate of residual feed 
intake in pigs, a statistical model that includes 
covariate adjustments for body weight and gain 
as well as ultrasonic measurements of body com¬ 
position is typically applied to ad libitum feed 
intake records (e.g. Cai et a/., 2008). When 
these methods are effectively implemented, the 
resulting estimates of residual feed intake are 
phenotypically independent of the performance 
traits, although underlying genetic covariances 
may exist. Optimal weighting of genetic merit 
for feed intake may also be achieved by defining 
a selection objective that includes as traits the 
parameters from the linear plateau model for 
feed intake and protein deposition maxima, as 
well as for the marginal ratio of fat and protein 
deposition (de Vries and Kanis, 1992). In this 
approach, feed intake can have either positive or 
negative value in the selection objective depend¬ 
ing on the relative positions in the population for 
feed intake and protein deposition capacities. 
Application of this approach has been hindered 
by the difficulty in identifying practical traits for 
the index and adequate estimates of genetic 
covariances between those measured traits and 
the traits describing the linear plateau model 
(Hermesch et a/., 2003). 

Additive (Co)variation and Selection 

Response 

As discussed in the previous section, understand¬ 
ing the underlying genetics of post-weaning 


performance in pork production must include 
knowledge of the genetic control of important 
biological components and associated strategies 
for genetic improvement through performance 
testing and selection. The literature provides esti¬ 
mates of the amount of genetic variance and 
covariance available for selection, reports from 
controlled selection experiments and a growing 
amount of detail around the biology underlying 
genetic variation in performance traits. 

Parameter estimates 

There have been many reported estimates of 
heritabilities (h 2 ) and genetic correlations (r g ) 
associated with post-weaning performance 
traits based on covariation among relatives. A 
summary of estimates from several of these 
studies is presented in Tables 14.1 and 14.2 
for environments of ad libitum or semi-ad libi¬ 
tum feed intake and restricted feed intake, 
respectively. The estimates are listed in the 
order of the citations in each case, along with 
the range and a simple average. It is important 
to mention that, in addition to differences in 
the way feed was provided, there were also dif¬ 
ferences in the breeds studied and some varia¬ 
tion in the methods implemented, such as the 
testing interval used and location of backfat 
measurements. For some traits (h 2 ) or trait 
combinations (r), most of the estimates were 
fairly similar across experiments. But, for those 
cases in which there were conflicting results, 
possible interactions with experimental differ¬ 
ences or fixed effects within experiments will 
be discussed. 

In Table 14.1 is a summary of estimates 
from animals either allowed ad libitum access 
to feed or fed by hand to appetite (semi-ad libi¬ 
tum). Heritability estimates for average daily 
gain (ADG) and backfat thickness (BF) under 
those conditions were mostly moderate and 
clearly indicate that these traits would be 
expected to respond to selection. Estimates for 
backfat thickness tended to be higher than 
those for ADG. Heritability estimates for daily 
feed intake (FI), residual feed intake (RFI) and 
feed conversion rate (feed/gain, or FCR) were 
also generally moderate and suggest that selec¬ 
tion would be successful. There were fewer 
estimates for LTGR and LTFC reported, each 



Table 14 . 1 . Estimates of heritabilities and genetic correlations for pigs with ad libitum or semi-ac/ libitum access to feed. 


References 3 


Heritabilities 

1,2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 19 


1,2, 3, 6, 8, 9, 10, 11, 12, 13, 
14, 15, 16, 19, 20, 21,22, 23 


1,2, 3, 6, 7, 8, 10, 11, 12, 15, 
17, 18, 24, 25 

I, 2, 3, 6, 7, 8, 10, 11, 12, 14, 
15, 17, 18 

15, 17, 18, 19 

II, 24, 25, 22 
11,24, 25 

Genetic correlations 

1,2, 3, 6, 7, 8, 11, 12, 15, 26 

1,2, 3, 6, 11, 12, 15 

1,2, 3, 6, 7, 8, 11, 12, 18, 26 


1,2, 3, 6, 11, 12, 15, 26 

1,2, 3, 6, 9, 11, 12, 13, 15, 26 


Trait(s) b Estimates 0 


Range 


Average 


ADG 

0.41°; 0.41°; 0.20°; 0.40; 0.07; 0.36; 0.41; 

0.41,0.03; 0.28; 0.49; 0.43; 0.37, 0.14; 

0.17, 0.33; 0.39; 0.24; 0.19; 0.23, 0.25; 

0.44; 0.42 

0.03-0.49 

0.29 

BF 

0.66°; 0.74°; 0.56°; 0.47; 0.60, 0.12; 0.34; 0. 

57; 0.59; 0.46, 0.25; 0.56, 0.50; 0.43; 

0.36; 0.45; 0.68; 0.56; 0.38; 0.48, 

0.48, 0.49, 0.48; 0.48, 0.44 

0.12-0.74 

0.49 

FI 

0.34°; 0.41°; 0.13°; 0.62; 0.23; 0.17, 0.26; 

0.16; 0.45; 0.29,0.19; 0.23; 0.14, 0.24; 

0.53; 0.29; 0.18 

0.13-0.62 

0.29 

FCR 

0.58°; 0.48°; 0.23°; 0.27; 0.18; 0.41,0.12; 

0.51; 0.28; 0.19, 0.15; 0.20; 0.16; 0.35, 

0.37; 0.27 

0.12-0.58 

0.30 

RFI 

0.17, 0.11,0.15, 0.10; 0.14, 0.24; 0.34, 

0.41,0.42; 0.29 

0.10-0.42 

0.24 

LTGR 

0.39; 0.44, 0.44, 0.46, 0.39; 0.38; 0.25 

0.25-0.46 

0.39 

LTFC 

0.34; 0.35; 0.25 

0.25-0.35 

0.31 

ADG/FI 

0.32°; 0.50°; 0.37°; 0.83; 0.89; 0.69; 0.73; 

0.80; 0.76, 0.41; 0.82; 0.89 

0.32-0.89 

0.67 

BF/FI 

0.38°; 0.08°; 0.35°; 0.59; 0.42; 0.51; 

0.24; 0.64 

0.08-0.64 

0.40 

ADG/FCR 

-0.57°; -0.71°; -0.86°; 0.34; -0.05; 

-0.69; -1.24; -0.28; -0.52; -0.52; 

-0.10; 0.78 

-1.24-0.34 

-0.37 

BF/FCR 

0.10°; 0.44° 0.24; 0.28, 0.36; 0.40; 0.33 

0.10-0.44 

0.31 

ADG/BF 

0.02°; -0.26°; -0.06°; 0.55; 0.35; 0.32; 

0.26, -0.25; 0.09, -0.12; 0.37; 0.37 

-0.26-0.55 

0.14 

Continued 


Continued 





Table 14.1. Continued. 


References 3 

Trait(s) b 

Estimates 0 

Range 

Average 

15, 17, 18, 19 

RFI/ADG 

0.11,0.17, 0.12, 0.18; -0.35, 0.00; 0.28, 

-0.35-0.28 

0.10 



0.23, 0.15; 0.06 



15 

RFI/BF 

0.67, 0.22, 0.67, 0.20 

0.20-0.67 

0.44 

17, 18, 19 

RFI/FCR 

0.70, 0.71; 0.67, 0.86, 0.84; (-0.69 FE) 

0.67-0.86 

0.76 (0.75) 

11 

ADG/LTGR 

0.96 



11 

ADG/LTFC 

-0.09 



11 

BF/LTGR 

0.02 



11 

BF/LTFC 

0.52 



24, 25 

FI/LTGR 

0.23; 0.31 


0.27 

24, 25 

FI/LTFC 

-0.45; -0.36 


-0.41 

24, 25 

LTGR/LTFC d 

0.76; 0.87 


0.82 


References: (1) Smith et al. (1962); (2) Smith and Ross (1965); (3) Standal and Vangen (1985); (4) Nordskog et al. (1944); (5) Fahmy and Bernard (1970); (6) McPhee et al. (1979); 

(7) Wyllie et al. (1979); (8) Cameron et al. (1988), estimates presented for Large White and Landrace populations, estimates for backfat (BF) are an average of those for shoulder, 
mid-back and loin measurements; (9) McPhee et al. (1988); (10) de Haer and deVries (1993); (11) Mrode and Kennedy (1993); (12) Cameron and Curran (1994a), estimates presented 
for Large White and Landrace populations; (13) Hetzer and Miller (1972a), estimates presented for Duroc and Yorkshire populations; (14) Cameron and Curran (1995a), restriction 75% 
of ad libitum, all estimates presented are averaged across selection lines in Large White population, estimates presented for BF are an average of those for shoulder, mid-back and loin 
measurement sites; (15) Johnson et al. (1999), estimates presented are based on alternative models for estimation of residual feed intake (RFI) - (i) initial test age and weight and test 
average daily gain (ADG), (ii) model (i) plus BF, (iii) model (i) plus loin eye area, (iv) model (i) plus BF and loin eye area; (16) van Wijk etal. (2005); (17) Gilbert et al. (2007), estimates 
presented are for boar candidates and for castrated males and females; (18) Hoque et al. (2007), estimates presented are from alternative models for estimation of RFI predicted by 
nutritional requirements, or by phenotypic or genetic regressions on metabolic body weight and ADG; (19) Cai etal. (2008); (20) Gray et al. (1968); (21) Berruecos et al. (1970); 

(22) Chen etal. (2003); (23) Newcom et al. (2005), estimates presented are for carcass and ultrasound BF; (24) Cameron (1994); (25) Cameron and Curran (1994b); (26) Robison and 
Berruecos (1973). 

b Traits: ADG, average daily gain; BF, backfat thickness; FI, daily feed intake; FCR, feed conversion ratio or feed/gain; RFI, residual feed intake; LTGR, lean tissue growth rate; LTFC, lean 
tissue feed conversion. 

c Semi -ad libitum fed (e.g. hand-fed to appetite). 

d LTFC based on an index of BF and FCR; increasing values of LTFC correspond to greater lean efficiency (lower FCR). 
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based on an ultrasonic prediction of lean con¬ 
tent of the animal at the completion of the test, 
but the reports were similar and indicate that 
these traits are also moderately heritable. 

Estimates of the genetic correlations of 
ADG and backfat with FI were all positive, and 
most were moderate to high. Estimates tended 
to be higher with ad libitum than with semi-ad 
libitum consumption, probably reflecting less 
than full expression of appetite in the latter set¬ 
ting. Although, based on these estimates, FI 
would be expected to increase with greater 
genetic merit for ADG, the genetic correlation 
between ADG and feed conversion was favour¬ 
able in most of the reports cited. The exceptions 
(McPhee et a/., 1979 (0.34); Wyllie et a/., 
1979 (-0.05)) were from studies in which pigs 
had continuous access to feed. Thus, the major¬ 
ity of reports suggest that the greater FI accom¬ 
panying increased ADG would be more than 
offset by fewer days on feed. Estimates of the 
genetic correlation between backfat thickness 
and feed conversion revealed that selection for 
less backfat should also improve efficiency. 

Estimates of the genetic correlation 
between RFI and either feed conversion or effi¬ 
ciency indicate that selection for less RFI can 
improve the efficiency of weight gain. The esti¬ 
mated correlations of RFI with other produc¬ 
tion traits suggest that RFI may also result in 
changes in ADG and backfat thickness, but 
that correlated response may also depend on 
the model used to estimate RFI (e.g. Johnson 

et a/., 1999). 

Reports of the correlation between ADG 
and backfat thickness were quite variable, rang¬ 
ing from moderate and favourable (-0.26) to 
moderate and unfavourable (0.55). Differences 
among experiments in the methods and tech¬ 
nicians used for backfat measurement, and the 
typically large sampling errors associated with 
estimates of genetic correlations in experimen¬ 
tal populations (Roots and Gibson, 1994), 
make interpretation of the variation among 
estimates difficult. However, it appears that 
some of the variation may be due to breed dif¬ 
ferences. For example, Cameron and Curran 
(1994a) reported that the correlation was 0.26 
in Large White selection lines, but -0.23 in 
Landrace selection lines. Perhaps the genetic 
correlation between ADG and backfat in a 
population depends on how tightly coupled the 


traits are with FI versus the ability to partition 
energy intake to lean tissue growth. In the 
report by Cameron and Curran (1994b), the 
estimated genetic correlations of FI with ADG 
and BF were 0.76 and 0.39, respectively, in 
the Large White lines, and 0.41 and 0.13, 
respectively, in the Landrace lines. The extent 
to which free access to feed is available may 
determine this link between growth and back¬ 
fat with FI. The average estimate of the corre¬ 
lation between ADG and backfat was 0.22 and 
-0.10 under ad libitum and semi-ad libitum 
intake, respectively. 

Mrode and Kennedy (1993) included 
LTGR and LTFC (feed/lean), based on ultra¬ 
sonic predictions of lean content of Yorkshire, 
Landrace and Duroc boars at the completion 
of test, in their study of FI, ADG and backfat. 
They reported that genetic merit for ADG and 
LTGR were closely correlated (0.96), but that 
the correlation between ADG and LTFC was 
small (-0.09). Conversely, the genetic correla¬ 
tion between backfat and LTGR was near zero 
(0.02), but the relationship between backfat 
and LTFC was moderate and positive (0.52). 
Cameron and Curran (1994a) and Cameron 
etal. (1994) reported genetic correlations of FI 
with LTGR and LTFC. Notice that LTGR and 
LTFC were based on ultrasonic backfat meas¬ 
urements and ADG and feed/gain, respec¬ 
tively, and that decreasing values for LTFC 
corresponded with more feed per unit of lean. 
These estimates indicate that, under an envi¬ 
ronment of ad libitum feed intake, LTGR and 
LTFC are positively but not perfectly corre¬ 
lated, and selection for improved LTFC would 
be expected to result in decreased genetic 
potential for FI, both results consistent with the 
suggestions of Fowler etal. (1976) as discussed 
in the previous section. 

Estimates of genetic parameters for pigs 
restricted in their access to feed are available 
from evaluations of data from Danish test sta¬ 
tions and from investigations of selection based 
on performance testing with scale feeding 
(Table 14.2). For studies in which parameters 
from environments of ad libitum and restricted 
intake were compared directly, the comparable 
estimate from Table 14.1 is presented in 
parentheses in Table 14.2. 

As might be expected, phenotypic varia¬ 
tion in ADG tended to be less with restricted 
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Table 14 . 2 . Estimates of heritabilities and genetic correlations for pigs with restricted feed intake. 3 


References 3 

Trait(s) b 

Estimates 0 

Range 

Average 

Heritabilities 

1,2, 3, 4, 5, 

ADG 

0.24; 0.39(0.20); 0.22, 0.14; 0.76(0.41), 

0.14-0.76 

0.30 

6, 7,8 

1,2, 3, 4, 5, 

BF 

0.14(0.03); 0.41(0.28); 0.30, 0.35; 0.17, 
0.16; 0.26(0.39) 

0.47; 0.41(0.56); 0.26, 0.29; 0.06(0.60), 

0-0.60 

0.31 

6, 7,8 

2 

2, 3, 4, 8 

FI 

FCR 

0(0.12); 0.60(0.34); 0.32, 0.36; 0.29, 

0.28; 0.35(0.43) 

0.20(0.13) 

0.35(0.23); 0.23, 0.19; 0.56(0.41), 

0.16-0.56 

0.29 

7 

LTGR 

0.16(0.12); 0.24(0.20) 

0.34, 0.28 


0.31 

Genetic correlations 

2 

2 

2, 3,4 

ADG/FI 

BF/FI 

ADG/FCR 

0.28(0.37) 

0.29(0.35) 

—0.93(—0.86); -1.07, -1.02; -1.03(-0.69); 

-1.07-0.93 

-1.0 

2,3 

BF/FCR 

-0.99(-1.24) 

0.30(0.24); 0.16; 0.23 

0.16-0.30 

0.23 

2, 3, 5, 6, 7 

ADG/BF 

-0.21 (-0.06); -0.07, -0.31; -0.22(0.35); 

-0.39-0.08 

-0.16 



0.08, -0.39; -0.10;-0.02 




References: (1) Lush (1936); (2) Standal and Vangen (1985); (3) Merks (1987), estimates presented are for Landrace 
and Yorkshire boars; (4) Cameron et al. (1988), estimates presented are for Large White and Landrace populations, 
backfat (BF) estimates are an average of those for shoulder, mid-back and loin measurements; (5) McPhee et al. (1988); 
(6) Gu et al. (1989), fed to a time-based scale, male and female averages presented; (7) Cameron et al. (1994), 
restriction to 75% of ad libitum, estimates presented are for Large White and Landrace populations; (8) Cameron and 
Curran (1995a), restriction to 75% of ad libitum, all estimates presented are averaged across selection lines in Large 
White population, BF estimates are an average of those for shoulder, mid-back and loin measurements. 
b Traits: ADG, average daily gain; BF, backfat thickness; FI, daily feed intake; FCR, feed conversion ratio or feed/gain; 
LTGR, lean tissue growth rate. 

Estimates in parentheses are for (semi -)ad libitum intake, and are from Table 14.1. 


than with ad libitum intake (Cameron et a/., 
1988; McPhee et al ., 1988; Cameron and 
Curran, 1995a). Restricted feeding tended to 
reduce phenotypic variation in FCR (Cameron 
et al ., 1988) and backfat thickness (McPhee 
et a/., 1988) in some studies, but there have 
also been reports that observable variation in 
feed conversion (Cameron and Curran, 1995a) 
and backfat (Cameron et al., 1988) was unaf¬ 
fected by feed restriction. Restriction of feed 
would be expected to have the greatest effects 
on phenotypic variances in backfat and feed 
conversion in populations in which ad libitum 
energy intake greatly exceeds lean growth 
potential. The associated variation in fat depo¬ 
sition and partitioning of energy to lean versus 
fat would be reduced as feed restriction moves 
energy intake closer to lean growth potential. 

Effects of feeding environment on herit¬ 
abilities (Table 14.2) are difficult to summarize 


from literature reports. Average heritability 
estimates reported for ad libitum or semi-ad 
libitum (Table 14.1) and restricted (Table 14.2) 
feeding environments were very similar for 
ADG and feed conversion; the average esti¬ 
mate for backfat tended to be greater with ad 
libitum or semi-ad libitum feeding than with 
restricted intake (0.49 versus 0.31). Among 
the studies in which (semi-)ad libitum and 
restricted intake were directly compared, 
Standal and Vangen (1985), Cameron et al. 
(1988) and McPhee et al. (1988) reported that 
heritability of ADG was greater with restricted 
feeding, but Cameron and Curran (1995a) 
reported a tendency for the opposite effect. 
Heritability estimates for backfat reported by 
Standal and Vangen (1985), Cameron et al. 
(1988) and Cameron and Curran (1995a) sug¬ 
gested a reduction due to restricted feeding, 
but McPhee et al. (1988) reported that feed 
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restriction tended to increase heritability for 
backfat. In none of these studies was precision 
adequate to conclude that the heritabilities 
were truly different. 

Most estimates of the genetic correlation 
between ADG and backfat in pigs with restricted 
access to feed were negative (Table 14.2), con¬ 
sistent with the trend noted when comparing 
estimates of the same correlation in pigs with 
semi-ad libitum versus ad libitum access (Table 
14.1). McPhee et al. (1988) reported that the 
correlation changed from 0.35 with ad libitum 
to -0.22 with restricted intake, and Standal and 
Vangen (1985) estimated that the correlation 
was -0.06 and -0.21 for pigs with semi-ad libi¬ 
tum and restricted intake, respectively. Although 
Gu et al. (1989) reported values of 0.08 and 
-0.39 in boars and gilts, respectively, under an 
environment of scale feeding, they noted that 
boars were fed more liberally than gilts and 
attributed the difference in the estimates to the 
relatively greater restriction of intake in the gilts. 
As pointed out by Gu et al. (1989), the change 
in sign of this correlation is also consistent with 
the suggestions of Fowler et al. (1976) and 
Webster (1977). Greater merit for ADG with ad 
libitum intake is associated with greater daily 
consumption and fat deposition, but superior 
merit for ADG when intake is restricted is due to 
the ability to partition available energy to the 
relatively efficient process of lean growth. 

The genetic correlation between ADG and 
feed conversion also seems to be affected by 
restricted feeding. Estimates of the correlation 
under (semi-) unlimited intake are moderate to 
high and negative. With restricted intake, all of 
the estimates cited are near -1.0. This result is 
also consistent with genetic models of growth. 
With ad libitum intake, increased genetic merit 
for ADG is associated with greater FI, but also 
fewer days on feed, resulting in a net decrease 
in feed conversion (feed/gain). When intake is 
restricted, greater merit for ADG must be 
through greater efficiency without increased 
FI, and is thus more closely correlated with 
feed conversion. 

Results from selection experiments 

The potential to exploit the additive genetic 
variation and covariation associated with post- 


weaning performance traits of pigs has been 
studied in numerous selection experiments. As 
suggested by moderate heritability estimates 
based on covariation among relatives 
(Table 14.1), the fundamental post-weaning 
traits of body growth and fatness will respond 
to selection. In an early study (Krider et al., 
1946), divergent selection on weight for age 
was applied in a Hampshire population. After 
nine generations of selection (Baird et al., 
1952; Craig et al., 1956), average weight at 
180 days of age was 27.7kg greater in the 
high line than in the low line. Subsequently, 
there have been several additional reports of 
significant direct responses to single-trait selec¬ 
tion for weight at a given age (Kuhlers and 
Jungst, 1990, 1991a,b) or for post-weaning 
ADG (Rahnefeld, 1971; Rahnefeld and 
Garnett, 1976; Fredeen and Mikami, 1986a,b; 
Woltmann et al., 1992, 1995; Clutter et al., 
1995b). 

Selection for total body growth with free 
access to feed has been accompanied by 
changes in FI. High-line Hampshires consumed 
0.64kg/day more feed than those from the 
low line during a 7 2-day post-weaning test that 
followed the nine generations of selection 
(Baird et al., 1952), and Clutter and Buchanan 
(1998) reported a 5 genetic standard deviation 
difference in daily feed intake between lines 
after ten generations of divergent selection for 
post-weaning ADG. When the divergent lines 
in the latter study were restricted to a standard 
amount of feed intake, differences in ADG 
between the lines were not significant 
(Woltmann et al., 1992). Thus, most of the 
response in ADG was attributed to changes in 
feed intake. Direct responses to selection for 
growth rate have been great enough to offset 
the correlated changes in feed intake, resulting 
in improved total body feed efficiency from 
upward selection for growth (Rahnefeld, 1973) 
and greater efficiency in upward than in down¬ 
ward divergent lines (Baird et al., 1952; Clutter 
and Buchanan, 1998). 

Changes in body composition in response 
to single-trait selection for growth have varied 
somewhat among experiments. Divergent 
selection for ADG resulted in greater backfat 
thickness at 105 kg body weight, but also 
greater LTGR, in the fast line than in the slow 
line (Woltmann et al., 1992, 1995). Selection 
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for greater weight at 200 days of age in a 
Duroc population was accompanied by an 
increase in backfat thickness, a decrease in 
percentage of muscle and no change in LTGR 
(Kuhlers and Jungst, 1992b), but selection on 
the same criterion in a Landrace herd resulted 
in increased LTGR without significant changes 
in backfat and percentage of muscle (Kuhlers 
and Jungst, 1993). Differences in correlated 
responses may be due to the specific criterion 
applied, the selection differential achieved, 
random drift and the effects of sampling. 
However, the change in body composition 
from selection for growth may also depend on 
the genetic potential for FI relative to lean 
growth in the base populations in which selec¬ 
tion is applied or, as depicted by the estimates 
of the genetic correlations between ADG and 
backfat (BF) in Tables 14.1 and 14.2, the real 
degree of access to feed. In general, it appears 
that direct response to selection for rate of 
total body growth with complete access to feed 
results largely through increased FI, and may 
be associated with an increased rate of fat as 
well as lean tissue growth. 

Divergent, controlled selection for backfat 
in Duroc and Yorkshire populations resulted in 
greater than 4 SD of total direct phenotypic 
response (Hetzer and Harvey, 1967; Hetzer 
and Miller, 1972a,b). Similar direct responses 
were also reported from two independent stud¬ 
ies in which five generations of downward 
selection for backfat were practised (Gray 
et a/., 1968; Berruecos et a/., 1970). 
Berruecos et al. (1970) reported that there 
was not a correlated change in pig growth 
associated with selection for decreased fat, but 
Hetzer and Miller (1972a) concluded that the 
correlated response in growth rate may vary by 
breed. In Durocs, ADG increased significantly 
in both divergent lines; in Yorkshires, ADG did 
not change significantly in the high fat line, but 
decreased in the low fat line. The variable cor¬ 
related responses in ADG or backfat to direct 
selection on the other trait are consistent with 
the estimates of the genetic correlation between 
the two traits (Table 14.1). 

Although estimates of heritability for feed/ 
gain suggest that selection would be effective 
(Table 14.1), results from selection experi¬ 
ments have been mostly discouraging. 
Dickerson and Grimes (1947) reported that 


divergent selection for feed/gain resulted in 
significant direct response and a correspond¬ 
ing realized heritability of approximately 24%. 
But in two other studies in which downward 
selection for feed/gain was practised (Jungst 
et a/., 1981; Webb and King, 1983), realized 
heritabilities were not significantly different 
from zero. Results of selection for RFI have 
been reported from two independent studies 
(Gilbert et a/., 2007; Cai et a/., 2008). After 

three generations of divergent selection on 
phenotypic estimates of RFI in a Large White 
population (Gilbert et a/., 2007), the estimated 
difference in phenotypic standard deviation 
units between the lines (high to low) was 
approximately 0.3 and 0.2 for RFI and FI, 
respectively. The lines did not differ for weight 
of backfat; growth rates were not reported, but 
the high line had between 0.2 and 0.25 greater 
feed/gain than the low line. Four generations 
of downward selection on estimated breeding 
values for RFI in a similar breed background 
(Yorkshire) resulted in reductions in RFI and FI 
of 96 and 165g daily, respectively, relative to 
an unselected control line (Cai et a/., 2008). 
The selected line grew more slowly (33 g daily) 
but was more efficient (1.36% greater gain/ 
feed) and had less backfat (1.99 mm) than the 
control line. 

Leymaster et al. (1979a,b) reported sig¬ 
nificant direct responses to selection based on 
ultrasonic estimates of either LTGR (weight of 
lean cuts at 160 days of age) or percentage of 
lean carcass (percentage of lean cuts at 81.6 kg 
live weight), but only selection on the estimate 
of LTGR was successful in improving both 
traits simultaneously. While selection for LTGR 
resulted in less carcass fat and greater ADG, 
response to selection for a greater percentage 
of lean carcass was accompanied by decreased 
ADG. These results are consistent with some 
of the estimates of the genetic correlation 
between ADG and backfat presented in Table 
14.1, and with the correlated decrease in ADG 
reported by Berruecos et al. (1970) when 
selection was for less backfat thickness. 

Several experiments have been conducted 
to test responses to selection for a classical 
index of ADG and backfat thickness (e.g. 
Vangen, 1979, 1980a,b; Cleveland et a/., 
1982, 1983a,b, 1988; Fredeen and Mikami, 
1986a,b,c,d,e; McKay, 1990, 1992), and 
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selection was generally successful in improving 
the component traits of ADG and backfat 
thickness. In a rather comprehensively designed 
experiment (Fredeen and Mikami, 1986a), 
index selection improved ADG and backfat as 
much as each component was improved by 
single-trait selection in two contemporary lines. 
However, consistent with the generally antago¬ 
nistic relationship between these components 
observed under ad libitum intake (Table 14.1), 
neither of the single-trait lines was as effective 
as the index at improving overall merit for the 
two traits combined. Only in the experiment 
reported by McKay (1990) was there little 
response in ADG, but that result could be 
explained by the relatively greater selection 
emphasis applied to backfat. 

The most detailed evaluation of correlated 
changes in LTGR and LTFC in response to the 
index selection was reported by Cleveland 
et al. (1983a). In a study of barrows from the 
select and control lines at three levels of feed 
intake, the select line was superior in rate of 
protein growth and feed required per unit of 
edible lean at each of the intake levels. Although 
specific responses in LTGR and LTFC in the 
other studies were not reported, the simultan¬ 
eous improvements in ADG, backfat and total 
body feed conversion reported by Vangen 
(1980a) and by Sather and Fredeen (1978) 
indicate that correlated improvements were 
made. Even with selection against backfat 
thickness, improvements in efficiency were 
achieved without significant reductions in 
genetic potential for FI. 

Selection indexes that included measure¬ 
ments of feed conversion, in addition to ADG 
and backfat, have also been evaluated as a 
means of improving LTFC (McPhee, 1981; 
Ellis et al ., 1988). Both McPhee (1981) and 
Ellis et al. (1988) reported improvements in 
backfat and feed conversion, but not significant 
improvements in ADG, resulting from index 
selection. Both studies also reported that 
reduced ad libitum FI accompanied selection. 
The lack of improvement in ADG was 
explained by the relatively great negative 
emphasis on feed conversion (feed/gain) 
and backfat, and the tendency for an antago¬ 
nistic genetic correlation between backfat and 
ADG, and possibly between feed conversion 
and ADG, as feeding becomes more liberal 


(Table 14.1). Even though LTGR was increased 
by selection on the indexes, the improvements 
in LTFC were primarily through decreased 
appetite. 

There have been a limited number of 
reports of experiments designed to evaluate 
restricted feeding as a performance testing 
environment and to test the theories of Fowler 
et al. (1976) regarding the interactions of 
LTGR and LTFC with testing regime. McPhee 
et al. (1988) reported that selection on an 
index that estimated LTGR in a testing envi¬ 
ronment of scale feeding successfully improved 
ADG, backfat thickness, feed conversion and 
lean ham, and that responses were greater 
when offspring from the line were allowed ad 
libitum intake than when they were restricted. 
Appetite (ad libitum FI) was also increased by 
selection on the index under scale feeding. 
Although the study did not include a line in 
which selection on the index was practised 
with ad libitum FI, the authors speculated that 
response would be greater when testing is with 
scale feeding, because it was under that envi¬ 
ronment that estimated heritabilities were 
greater and the genetic correlation between 
ADG and backfat was favourable (Table 14.2). 

A comprehensive study was conducted in 
Landrace and Large White populations to test 
divergent, controlled selection for either FI, 
LTGR or LTFC in pigs allowed ad libitum feed 
intake and LTGR in pigs with restricted feed 
intake (Cameron, 1994; Cameron and Curran, 
1994a; Cameron et al ., 1994). When the test¬ 
ing environment allowed ad libitum feed intake 
and performance was measured in pigs given 
free access to feed, the greatest amount of 
improvement in LTGR was from direct selec¬ 
tion. In addition, correlated response in LTFC 
from selection on LTGR was similar to (in 
Landrace) or greater than (in Large White) 
direct response from selection on LTFC. 
Correlated response in FI was zero or slightly 
positive from selection on LTGR, but selection 
on LTFC caused a reduction in FI. In the same 
set of studies, progeny from the line in which 
selection was for LTGR in a testing environ¬ 
ment of scale feeding and from the lines in 
which selection was with ad libitum consump¬ 
tion were compared in both feeding environ¬ 
ments to determine the best overall selection 
strategy (Cameron and Curran, 1995a). When 
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pigs from the Large White lines were allowed 
ad libitum intake, ADG was greatest in the 
line selected for high LTGR under scale feed¬ 
ing, followed, respectively, by those selected 
under ad libitum intake for high LTGR and 
improved LTFC. The lines ranked the same for 
FI as for ADG, but were similar to one another 
in backfat and feed conversion. Performance 
in the lines was more similar when progeny 
were scale fed, but pigs from the line selected 
for LTGR with scale feeding still tended to have 
faster gains and less backfat than those from 
the lines in which selection was for LTGR or 
LTFC with ad libitum access. In Landrace 
pigs, response in the same traits was similar in 
the lines selected either for high LTGR or 
improved LTFC with ad libitum intake or for 
high LTGR with scale feeding, regardless of 
whether progeny were allowed ad libitum or 
restricted feed intake. 

Cameron and Curran (1995b) also 
reported results from an evaluation of carcass 
traits in which boar and gilt progeny from the 
three lines (LTGR-ad libitum feeding, LTFC-ad 
libitum feeding and LTGR-scale feeding) were 
tested under the feeding conditions of selection 
in their respective lines. Selection for LTGR-ad 
libitum increased LTGR, but did not change 
rate of fat growth. Rate of fat growth was 
reduced in the line selected for LTFC-ad libi¬ 
tum , but the change in LTGR was not signifi¬ 
cant. Selection for LTGR-scale significantly 
increased LTGR and decreased rate of fat 
growth. 

Factors underlying genetic variation 
in performance traits 

The widespread development and availability 
of laboratory assays for hormonal molecules in 
the pig have provided opportunities for investi¬ 
gations of the physiological factors underlying 
variation in performance traits. In addition to a 
goal of greater understanding of the underlying 
mechanisms of quantitative genetic variation, 
many of these studies were also aimed at the 
discovery of physiological indicator traits that 
could be used to enhance the accuracy of selec¬ 
tion for objectives such as LTFC, for which 
conventional methods are expensive and diffi¬ 
cult to implement (Blair et a/., 1990). Despite 


the inherent challenges in effective design of 
these studies owing to the general complexity 
of growth and development, an extensive body 
of information related to physiological path¬ 
ways underlying these traits has led to studies 
of the somatotrophic axis and putative regula¬ 
tors of appetite. 

Large performance differences between 
breeds were the focus of some of the earliest 
studies in livestock. Buonomo et al. (1987) 
reported that the levels of circulating IGF1 
(insulin-like growth factor 1) in a cross of large, 
fast-growing breeds 1}A Large White x l A 
Landrace x V 2 Duroc) were 24 and 105% 
greater, respectively, than levels in smaller, 
slower growing Yucatan micro and Hanford 
miniature pigs. While results like these proba¬ 
bly reveal factors contributing to the observed 
differences in growth and development, breed 
comparisons are confounded with all genetic 
selection and drift that has occurred in the his¬ 
tory of the populations. Experimental lines in 
which selection has been only for traits associ¬ 
ated with growth and efficiency are free of con¬ 
founding with other selection effects, and may 
reveal more directly indicator traits that can be 
used for within-line selection. 

Lines selected divergently for post-weaning 
ADG (Woltmann et al., 1992) were used as a 
basis for a series of studies to test the potential 
roles of underlying physiological factors. 
Selection had resulted in 46% greater ADG, 
60% greater average FI and 13% greater back¬ 
fat in the fast than in the slow line (Clutter 
et al., 1998a). When serial blood samples were 
collected from gilts representing the fast and 
slow lines at approximately 55 kg body weight, 
there were no significant differences in mean 
GH (growth hormone) concentration or in pul- 
satility of GH secretion, but there was a greater 
mean concentration of IGF1 in the fast than in 
the slow line (Clutter et al., 1995b). In addition 
to greater circulating IGF1, there was less 
measured activity of IGF-binding proteins 
(IGFBP-2 and IGFBP-3) in pigs from the fast 
line, suggesting a greater availability of active 

IGF1. 

Based on results of scale-feeding studies of 
these divergent lines in which most of the 
response in growth rate could be attributed to 
changes in daily feed intake (Woltmann et al., 
1992), the role of cholecystokinin-8 (CCK) as 
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a putative satiety factor was the focus of addi¬ 
tional experiments. When pigs from the lines 
were trained to eat twice a day to appetite and 
serial blood samples were collected during a 
2-h feeding period that followed overnight feed 
deprivation, CCK per unit of feed intake was 
significantly greater in pigs from the slow line 
than in pigs from the fast line (Clutter et al., 
1998a). In a second experiment, pigs sampled 
from the lines were infused with synthetic CCK 
during free access to feed (Clutter et al., 
1995a). Feed intake of pigs from the slow line 
was reduced more than feed intake of pigs 
from the fast line in response to the synthetic 
CCK, suggesting a greater sensitivity to this 
putative satiety signal. 

Genetic lines of pigs resulting from long¬ 
term divergent selection for components of 
LTFC have also been used as a basis for growth 
physiology studies (Cameron and Curran, 
1994b; Cameron et al., 1994). Serum IGF1 
concentrations were compared in lines created 
by allowing pigs ad libitum access to feed and 
selected divergently for either FI or LTGR 
(Cameron et al., 2001). As expected, pigs 
from the line selected for high FI ate more than 
those from the line selected for low FI, and also 
grew faster, had more backfat and were less 
efficient. Figs from the line selected for high 
LTGR grew faster than those from the line 
selected for low LTGR, but did not eat more, 
had less backfat and were more efficient. A sig¬ 
nificant difference was detected in mean IGF1 
concentration at 6 weeks of age in the FI lines 
(158 versus 104pg/l in the high and low lines), 
but the difference declined in measurements 
made at 30 and 90kg. There was not a differ¬ 
ence in mean IGF1 between the LTGR lines at 
6 weeks of age, but the high line had greater 
mean IGF1 than the low line by 30 kg of body 
weight (198 versus 153pg/l in the high and 
low lines). 

Because of their respective putative roles 
in appetite stimulation and inhibition, serum 
levels of neuropeptide Y (NPY) and leptin were 
measured in the FI and LTGR lines, as well as 
in lines created through providing free access 
to feed and selecting divergently for LTFC 
(Cameron et al., 2000, 2003). The FI lines 
did not differ in NPY concentration, but the 
line selected for high FI had a greater level of 
serum leptin than the line selected for low FI. 


The corresponding estimates of line means for 
performance traits were consistent with the 
phenotypic means reported previously by 
Cameron et al. (2001), in that upward selec¬ 
tion in FI resulted in greater feed intake, more 
predicted lipid weight and less efficiency than 
downward selection. Serum concentration of 
NPY was greater in the high LTGR line than 
in the low LTGR line, but leptin levels did 
not differ significantly between the lines. 
Corresponding estimates of line mean differ¬ 
ences were again similar to phenotypic means 
reported by Cameron et al. (2001), in that the 
high line grew faster, had more predicted pro¬ 
tein weight and less predicted lipid weight and 
was more efficient than the low line; but in 
contrast to the earlier report, the high line was 
estimated to eat more per day than the low 
line. A lesser level of serum leptin in the high 
than in the low LTFC line was associated with 
less predicted lipid and more predicted pro¬ 
tein and greater efficiency. Serum NPY was 
greater in the high than in the low LTFC line, 
but daily feed intake did not differ significantly 
between the lines. Correlations of leptin with 
FI and fatness indicated that the responses in 
leptin were largely due to greater fat deposi¬ 
tion rather than to greater FI per se and, over¬ 
all, the results suggest that response to 
selection for greater FI is more likely due in 
part to less sensitivity to leptin rather than less 
leptin production. Serum concentrations of 
NPY were not strongly associated with feed 
intake. 

Correlated response in the circulating level 
of IGF1 at a target age of 35 days (referred to 
as juvenile IGF1) as a result of controlled selec¬ 
tion for RFI was reported by Bunter et al. 
(2010). Serum IGF1 was measured in a total of 
2570 animals sampled from the selected and 
control lines beginning at generation two. 
Genetic correlations were estimated for the 
population between IGF1 and the production 
traits, and genetic trends in the selected and 
control lines through generation five were used 
to estimate realized correlated responses. 
Estimated genetic correlations were significant 
between juvenile IGF1 and backfat, loin muscle 
area, RFI and feed/gain (0.52 ±0.11, -0.35 ± 
0.12, 0.63 ± 0.15 and 0.78 ± 0.14, respec¬ 
tively), but not different from zero between 
IGF1 and lifetime ADG, test ADG and FI 
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(0.06 ± 0.14, -0.19 ± 0.14 and 0.26 ± 0.17, 

respectively). Correlated response in IGF1 to 
downward selection on RFI reached -1.07 SD 
by generation five, corresponding to a realized 
genetic correlation of 0.84. 

Across the range of these experiments, 
significant results emerge that may begin to 
reveal some of the factors that underlie genetic 
variation and contribute to response to selec¬ 
tion for performance traits in pigs. Each of the 
studies of IGF1 demonstrated that selection 
had affected some aspect of the IGF1 pathway, 
but the impact of selection observed probably 
depends on the timing of measurement of 
IGF1 and the selection emphasis on the com¬ 
ponents of post-weaning performance. 
Divergent selection for post-weaning ADG 
appears to have altered the production of IGF1 
and associated binding proteins by several 
weeks after weaning, so that a greater amount 
of active IGF1 may be available in pigs selected 
for fast growth than in pigs selected for slow 
growth (Clutter et a/., 1995b). Greater levels 
of circulating IGF1 were also observed begin¬ 
ning at 6 weeks of age in a line selected for 
greater feed intake than in a line selected for 
less feed intake (Cameron et a/., 2001). A sim¬ 
ilar positive genetic relationship of IGF1 with 
feed intake and feed/gain was observed when 
selection was for less RFI, but the estimated 
genetic relationships between IGF1 and ADG 
were not significant even though the selection 
response in RFI was accompanied by a slight 
decrease in growth rate (Bunter et ai, 2010). 
The likely complexity in the physiological 
mechanisms that determine overall post- 
weaning performance begins to become even 
more apparent when considering that mean 
IGF1 at 30 kg body weight was greater in the 
divergent line that was selected for high LGA 
(lean growth rate with ad libitum access to 
feed), and also superior for total body gain and 
efficiency but not different from the low-LGA 
line for FI (Cameron et ai, 2001). 

Selection that resulted in relatively greater 
appetite may have both increased the amount 
of CCK circulating in response to feed intake 
and the sensitivity to the CCK satiety signal, 
the latter suggesting a change in CCK receptor 
characteristics (Clutter et ai, 1995a, 1998a). 
Response to selection for FI and the correlated 
response in fat deposition were not due to 


insufficient leptin; therefore a decline in sensi¬ 
tivity to leptin may have contributed (Cameron 
et ai, 2000). There was no evidence that 
genetic divergence in energy intake was due to 
increased appetite via greater circulating NFY 
(Cameron et al ., 2003). 

Each of the selection lines discussed here 
originated from a single base population and 
was subjected to only a single selection criter¬ 
ion (post-weaning ADG, FI, lean tissue gain, 
lean efficiency or RFI) after establishment of 
the line. Consequently, physiological differ¬ 
ences between the lines can be attributed to 
founder effects that may have occurred by 
chance in the derivation of the lines from the 
base population, random drift in gene fre¬ 
quency due to non-infinite line size and chance, 
or the selection applied. Although the effects 
of these factors are confounded in the com¬ 
parisons made, there was clearly intense 
single-trait selection achieved in these experi¬ 
ments, and it is likely that the physiological 
differences observed are associated with the 
divergent genetic potential of the lines for FI, 
growth and tissue deposition. 

Developments in tools related to molecu¬ 
lar biology over the last two decades have 
allowed the extension of these types of studies 
to exploration for the genes or genomic regions 
that underlie variation in performance traits. 
Two distinct, but parallel, approaches that have 
been used in this exploration are comprehen¬ 
sive scans of the genome and the study of spe¬ 
cific candidate genes. 

A large number of genome scans for 
quantitative trait loci (QTLs) contributing to 
variation in performance traits in pigs have 
been reported, beginning with a few coarse 
scans using microsatellite markers in crosses of 
divergent lines, and accelerating in number as 
dense marker arrays have become available. 
The Fig QTL data base (PigQTLdb, at http:// 
www.genome.iastate.edu/cgi-bin/QTLdb/ 
SS/index) contains information for porcine 
QTLs reported in the literature. In Release 
10 of the Pig QTLdb (30 December 2009) 
there are 429 QTLs for body weight or gain, 
590 QTLs for leanness traits of the live pig or 
overall carcass composition and 65 QTLs for 
feed intake or feed conversion. 

Some studies of candidate genes in 
pigs have been based on knowledge of how 
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physiological pathways may contribute to var¬ 
iation in performance traits. A polymorphism 
in the porcine gene encoding the cholecysto- 
kinin-A receptor (CCKAR; Clutter et a/., 
1998b) was targeted because of evidence that 
variation in sensitivity to the CCK satiety sig¬ 
nal may contribute to variation in feed intake 
and growth in pigs (Clutter et a/., 1995a, 
1998a). Houston et al. (2006) conducted 
association studies of the polymorphism in 
CCKAR reported by Clutter et al. (1998b) 
and of a second polymorphism in the 5' 
untranslated region (5' UTR) of CCKAR that 
they had determined disrupts binding of the 
YY1 transcription factor. The association 
analyses were based on data from a Meishan x 
Large White F2 cross, from the divergent 
selection lines of Large White reported by 
Cameron and Curran (1994b) and from seg¬ 
regating commercial lines. Although the two 
markers were in strong linkage disequilibrium 
and were generally associated with perform¬ 
ance traits across these populations, the 
5'-UTR polymorphism was strongly and con¬ 
sistently associated with both feed intake and 
growth rate. 

Other candidate gene studies in pigs have 
been initiated because of mutations reported in 
model species with large effects on traits related 
to pork performance traits. Kim et al. (2000) 
began investigation of the melanocortin-4 
receptor gene ( MC4R ) in pigs after reports of 
the effects of a mutation in Mc4r in mice that 
resulted in large increases in feed intake and 
obesity, and demonstrated that a missense 
mutation (p.Asp298Asn) was associated with 
fatness and daily gain. At a more basic level, 
p.Asp298Asn displayed stimulatory effects on 
cAMP production (Kim et al ., 2004), providing 
evidence that it may directly cause variation in 
these traits. Subsequent studies of p.Asp298Asn 
and other variants within porcine MC4R across 
multiple lines of pigs have revealed generally 
consistent associations with body composition 
and growth, but also that specific mutations 
within the gene may be more or less strongly 
associated with performance traits in individual 
lines (Fan et a/., 2009). 

The MC4R polymorphism (Kim et al., 
2000) and the 5'-UTR polymorphism in 
CCKAR (Houston et al., 2006) are each 
examples of candidate genes being used in 


commercial pork production as tools by which 
to increase accuracy of selection for post- 
weaning performance traits. There have been 
many other reports of significant associations 
between markers in candidate genes and per¬ 
formance traits in pigs, and discoveries from 
comprehensive genome scans have begun to 
be developed and implemented in marker- 
assisted selection programmes designed to 
account for significant proportions of the addi¬ 
tive variance in performance traits (Dekkers, 
2004). As sequencing of the porcine genome 
is completed, the resolution of our knowledge 
of the basic factors contributing to genetic vari¬ 
ation in performance traits will continue to 
improve and, along with it, the potential for 
increasingly powerful tools for selection and 
improvement. 

Implications 

Great opportunities exist for the genetic 
improvement of post-weaning performance 
through within-line selection. Various selection 
experiments have shown that predicted 
responses in components of LTGR and LTFC 
based on models of growth (Fowler et al., 
1976) are remarkably accurate. In testing envi¬ 
ronments of ad libitum intake, selection for 
LTFC was successful primarily through reduc¬ 
tion in appetite accompanied by a lesser rate of 
fat growth, rather than through increased 
LTGR. Selection for LTGR by testing with ad 
libitum intake improved LTGR and LTFC with¬ 
out reducing appetite, but did not reduce the 
rate of fat growth. Implementation of scale-fed 
testing to select for LTGR seems to combine 
the best features of selection for LTGR and 
LTFC under ad libitum intake, and may result 
in market pigs that are the most valuable under 
either commercial feeding environment (Fowler 

et al., 1976; Cameron and Curran, 1995b). 

The potential benefit of any physiological 
factor as an indicator trait in selection for post- 
weaning performance is a function of the herit- 
ability of the indicator and the genetic 
correlation with the component(s) of post- 
weaning performance to be improved, relative 
to the expected response to selection without 
the indicator trait, i.e. the marginal improve¬ 
ment in accuracy. Ultimately, decisions to 



342 


A.C. Clutter 


implement physiological indicators in commer¬ 
cial selection programmes will be determined 
by not only the confirmed genetic associations 
of those indicators with components of breed¬ 
ing objectives for post-weaning performance 
(e.g. components of LTFC) within targeted 
populations, but also analyses of the cost of 
sample collection and assay completion rela¬ 
tive to the marginal increase in accuracy and 
response. In addition, the possible opportunity 
cost or benefit of using physiological markers 
versus emerging genomics technologies to 
increase accuracy of selection will need to be 
assessed. Detailed updates and discussion of 
genomic approaches to selection in the pig are 
presented in Chapters 8 and 16. 


Genetic Relationships of Growth 
Performance with Reproduction 

To determine the optimum emphasis on lean 
gain and efficiency in selection objectives for 
each line contributing to a commercial breed¬ 
ing system, the genetic relationships (correla¬ 
tions) of the components of LTGR and LTFC 
(i.e. ADG, body composition and feed conver¬ 
sion) with traits of reproductive performance 
must be known. Expectations for changes in 
reproductive performance from selection for 
market animal performance can be derived 
from estimates of genetic correlations between 
the relevant traits and from estimates of corre¬ 
lated responses in selection experiments. 


Estimates of genetic correlations 

Two publications in 1981 included summaries 
of estimated genetic correlations between post- 
weaning performance and traits of puberty or 
reproduction based on covariances among rel¬ 
atives (Hutchens and Hintz, 1981; Johansson, 
1981). A review by Brien (1986) also included 
a discussion of literature estimates of genetic 
relationships between growth and reproduc¬ 
tion in the pig and other mammalian species. 
Many of the early reported estimates of genetic 
correlations that follow were included in more 
detail in those summaries. Vogt et al. (1963) 
first reported that the genetic correlation 


between post-weaning growth rate and litter 
size was near zero, but in subsequent studies in 
which first and second parities were considered 
separately (Morris, 1975; Johannson, 1981), 
the correlation tended to be positive for second 
litters. Conversely, in a study of more modern 
populations, Holm et al. (2004) reported that, 
in Norwegian Landrace pigs born between 
January 1990 and January 2000, there was 
an unfavourable genetic correlation between 
age at 100 kg and number born alive at first 

and second parities (0.60 ± 0.05 and 0.42 ± 
0.06, respectively). Although estimated genetic 
correlations between production and reproduc¬ 
tion traits were generally small in a study of 
Czech Landrace and the Slovak White Meaty 
breeds (Pekoviova et al., 2002), the estimate 
of greatest magnitude suggested an unfavour¬ 
able relationship between ADG and number 
born alive in parity two and greater (-0.12 and 
-0.28 in the two breeds, respectively). Serenius 
et al. (2004) reported in Finnish Landrace and 
Large White populations a tendency for an 
unfavourable genetic correlation of ADG with 
total number born (-0.16 ± 0.13), but favour¬ 
able relationships of ADG with number of still¬ 
born piglets (-0.25 ± 0.15) and piglet loss 
during the suckling period (-0.43 ± 0.16). 
Young et al. (1977) were unable to estimate 
genetic correlations between ADG and litter 
size because of a negative estimate of sire vari¬ 
ance. However, they reported a positive genetic 
correlation between ADG and ovulation rate, 
and a tendency for a negative relationship 
between ADG and embryo survival (number of 
ovulations/number of embryos). These results 
for ADG and ovulation rate seem to be consist¬ 
ent with the results of selection for ovulation 
rate in the pig (Rosendo et al., 2007) and the 
positive genetic correlation between body size 
and ovulation rate observed in other species 
(Brien, 1986). 

Morris (1975) reported an unfavourable 
genetic correlation between killing-out percent¬ 
age and litter size, but genetic correlations 
between backfat thickness and first and second 
parity litter sizes that were (negative) favourable 
and generally small to moderate in magnitude. 
Bereskin (1984) also reported a favourable 
correlation between backfat and litter size, as 
well as a favourable genetic correlation between 
loin muscle area and litter size. However, the 
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associated standard errors reported by Morris 
(1975) and Bereskin (1984) indicate that the 
estimates were not significantly different from 
zero. Johansson and Kennedy (1983) reported 
some tendency for unfavourable genetic cor¬ 
relations between backfat and litter size, but 
standard errors were not included and the esti¬ 
mates of unfavourable relationships (r g = 0.13 
to 0.22 for BF and number born alive) were 
probably not different from zero. More recent 
estimates of these genetic correlations have 
consistently revealed at least a tendency for an 
unfavourable genetic relationship of leanness 
with litter traits. Holm et a/. (2004) reported 
that backfat thickness was uncorrelated with 
number born alive, but estimated that the 
genetic correlations of lean meat content with 
number born alive at the first and second pari¬ 
ties were -0.12 ± 0.07 and -0.24 ± 0.09, 
respectively. Estimated genetic correlations of 
fat percentage with total number born and 
number of stillborns were 0.19 ± 0.11 and 
-0.34 ± 0.13, respectively, in Finnish Large 
White pigs (Serenius et a/., 2004); estimates of 
genetic correlations of lean percentage with 
the same two litter traits were -0.35 ± 0.10 
and 0.18 ± 0.14, respectively. Imboonta et a\. 
(2007) reported unfavourable correlations of 
backfat with total number born and number of 

stillborns (0.14 ± 0.08 and -0.22 ± 0.11, 
respectively). 

The estimated genetic correlation between 
first litter size and feed conversion ratio was near 
zero in independent studies reported by Morris 
(1975) and Johansson (1981), but in both stud¬ 
ies the correlation between second litter size and 
feed conversion tended to be favourable. 
Bereskin (1984) reported a tendency for 
favourable genetic correlation between first lit¬ 
ter size and feed conversion. However, based 
on the difference between sire and dam com¬ 
ponents of variance for feed conversion, 
Dickerson (1947) and Dickerson and Grimes 
(1947) concluded that superior genetic merit 
for efficiency was associated with poorer milk¬ 
ing ability of the sow. In more modern popula¬ 
tions of maternal breeds, the estimated genetic 
associations between litter traits and feed con¬ 
version have tended to be mostly unfavourable. 
Holm et al. (2004) reported genetic correla¬ 
tions of 0.23 ± 0.08 and 0.20 ± 0.10 for 
feed/gain with number born alive at each of 


the first two parities in Norwegian Landrace. 
The sign of the estimated correlation of feed/ 
gain with total number born was unfavourable 
in Finnish Landrace and Large White (0.20 ± 
0.13; Serenius et a/., 2004), but there was a 
tendency for a desirable association between 
feed/gain and number of piglets lost during 
suckling (0.42 ± 0.17). 

In attempts to draw overall conclusions 
regarding the genetic relationship between 
post-weaning performance and litter charac¬ 
teristics, some researchers have estimated 
genetic correlations between indexes of the rel¬ 
evant traits. Bereskin (1984) reported that 
genetic correlations of a performance index of 
ADG and backfat thickness with two indexes of 
sow productivity (litter sizes and weights) were 
not different from zero (average estimate 
= 0.07), indicating that selection for lean gain 
would not adversely affect litter characteristics. 
Those results support the findings of Morris 
(1975) that litter sizes and litter weight were 
not genetically correlated with a total point 
score that included ADG, feed conversion and 
carcass traits (estimates ranged from -0.04 to 
0.02). However, in a more recent evaluation of 
field data from four US swine breeds, Chen 
et al. (2003) reported that, along with an unfa¬ 
vourable genetic correlation between number 
born alive and backfat (average estimate = 
0.188), the genetic correlation between 
number born alive and lean gain was also 
slightly, but significantly, unfavourable (average 
estimate = -0.095). Similarly, Arango et al. 
(2005) concluded from parameter estimates in 
commercial Large White sows that intense 
selection for both faster growth and greater 
leanness would result in increased pre-weaning 
mortality in piglets born to first-parity sows. 

It is more difficult to find estimates of the 
genetic relationships between performance 
traits and traits of reproduction that are not 
routinely measured. Favourable genetic corre¬ 
lations between ADG and age of puberty have 
been reported by Young et al. (1978) and by 
Hutchens et al. (1981). Results from these 
same studies indicated that the genetic correla¬ 
tion between backfat thickness and age of 
puberty was either favourable or not different 
from zero, suggesting that selection for lean 
gain would not be detrimental to age of puberty. 
Rydhmer et al. (1994) reported a tendency for 
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an unfavourable genetic correlation between 
lean growth and age of puberty; their estimates 
of genetic correlations from a population of 
Swedish Yorkshires selected for lean tissue 
growth rate were 0.40 (± 0.10) between lean 
percentage and age of puberty, and 0.21 
(± 0.23) between lean gain and age of puberty. 
But Holm et al. (2004) estimated favourable 
genetic correlations of age at 100 kg with age 
at first service (0.68 ± 0.02), interval from first 
weaning to next service (0.16 ± 0.07) and 
interval from second weaning to next service 
(0.20 ± 0.10), and correlations of approxi¬ 
mately zero between lean meat content and 
these same breeding traits. 

Correlated responses to selection 

Many studies of selection for post-weaning per¬ 
formance have included measurements of those 
traits of reproduction that are routinely avail¬ 
able in the production system, primarily litter 
sizes and litter weights from birth to weaning. In 
a few studies, traits such as age of puberty, 
breeding performance and sow weight changes 
have also been recorded. Unfortunately, data 
on reproductive performance in experimental 
populations selected for traits of market pig 
performance typically have several limitations: 

1. Data that are typically only from gilts and 
first litters. As noted earlier, correlations 
between performance and reproduction may 
differ by parity. If selection for lean gain and 
efficiency affects reproductive ability, it is pos¬ 
sible that the effects are manifested after the 
gilt weans her first litter. 

2. Measurements of correlated responses are 
often based on the reproduction of selected 
females, and consequently are subject to bias. 

3. Many important traits of reproduction are 
quite variable and inherently difficult to meas¬ 
ure with precision. Thus, data collected from 
the relatively small number of females main¬ 
taining a given selection line in each genera¬ 
tion are usually not adequate to allow 
meaningful conclusions. 

4. Most experimental selection lines of pigs 
are not replicated and are of relatively small 
effective size. Therefore, measurements of cor¬ 
related (and direct) responses to selection are 


likely to be confounded with random drift in 
gene frequencies. 

Among experiments in which selection 
was for some measurement of growth, signifi¬ 
cant changes in litter weights or pig weaning 
weights reflecting changes in the pig’s direct 
genetic merit for pre-weaning growth and/or 
the maternal ability of the sow were only 
observed by Craig et al. (1956) with selection 
for weight at a post-weaning age, and by 
Garnett and Rahnefeld (1976) with selection 
for ADG. In both cases, a positive impact of 
upward selection for growth was indicated. 
Although trends in litter sizes were not signifi¬ 
cant in most of the studies in which selection 
was for growth, a positive effect of selection 
for weight at 70 days of age on 21-day litter 
size was reported by Kuhlers and Jungst 
(1992a). Conversely, selection for weight at 
200 days of age in Durocs decreased total litter 
size at birth (Kuhlers and Jungst, 1992b), and 
pre-weaning mortality increased due to selec¬ 
tion for post-weaning ADG (Garnett and 
Rahnefeld, 1976). Number born alive did not 
change significantly in the former study 
(Kuhlers and Jungst, 1992b). 

Litter traits did not change significantly 
over ten generations of divergent selection for 
ADG (Clutter and Buchanan, 1998), but cor¬ 
related response in appetite seems to be 
reflected in weight changes of gilts during ges¬ 
tation and first lactation. Gilts from the line 
selected for slow growth and expressing rela¬ 
tively less appetite weighed less at farrowing, 
but lost a significantly greater amount of weight 
during lactation than gilts from the line selected 
for fast growth. Although size and weight of the 
first litter were not affected, weight loss in the 
female was great enough to suggest that future 
reproduction might be inhibited. Unfortunately, 
females were marketed after the first litter, 
hence re-breeding performance and subsequent 
litter characteristics were not measured. 

For studies in which selection was for 
divergent or decreased backfat thickness, only 
Berruecos et al. (1970) reported significant 
changes in reproduction. Selection for 
increased leanness resulted in a significant 
decline in litter size at birth and weaning but, 
surprisingly, by 130 days of age the difference 
in litter size was nearly gone. Although, based 
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on the difference of sire and dam components 
of variance for feed conversion, Dickerson and 
Grimes (1947) concluded that females with 
greater efficiency for total body weight gain 
expressed poorer nursing ability, correlated 
responses in reproduction were not reported 
from studies in which single-trait selection for 
feed conversion was practised. 

As with most of the experiments imple¬ 
menting single-trait selection, many of the stud¬ 
ies of direct or alternative criteria to improve 
LTGR or LTFC revealed no significant trends in 
traits of reproduction. However, some excep¬ 
tions suggest the potential for antagonistic rela¬ 
tionships. Although the regressions of first 
parity litter sizes on generation number in a line 
selected for weight of lean cuts at 160 days 
(LTGR) were negative but not different from 
zero (DeNise et a/., 1983), second parity litter 
sizes and corresponding litter weights were sig¬ 
nificantly decreased by selection. Second parity 
litter characteristics were also significantly 
decreased by selection for percentage of lean 
cuts at approximately 80 kg body weight. Kerr 
and Cameron (1995) did not detect correlated 
responses in litter traits to selection for either 
LTGR with ad libitum intake or LTGR with 
scale feeding, but litter sizes and weights at birth 
and weaning were significantly decreased by 
selection for LTFC with ad libitum intake. This 
result is consistent with their estimate of posi¬ 
tive genetic correlation between litter size traits 
and FI (Kerr and Cameron, 1996), given the 
decrease in FI when selection was for LTFC 
with ad libitum intake. 

Knowledge of the genetic relationships 
between performance and reproduction traits 
may also be gained by measuring correlated 
responses in performance traits to selection for 
reproduction. Petry et al. (2004) summarized 
correlated responses in a line selected for 11 
generations on an index of ovulation rate and 
embryonic survival, then selected through to 
generation 19 for number of fully formed pig¬ 
lets born (Nebraska index line). Direct genetic 
effects for backfat at 88.2 kg and days to 88.2 kg 
did not differ between the index and control 
lines, and even though index line pigs had a loin 
muscle area that was 1.58 cm 2 less than control 
line pigs, they concluded that the methods of 
selection for increased litter size in this line had 
little impact on performance traits. Two other 


studies were of lines that were derived at earlier 
points in time from the Nebraska index and 
control lines. Ruiz-Flores and Johnson (2001) 
reported that an increase in backfat accompa¬ 
nied eight generations of two-stage selection for 
ovulation rate and number of fully formed pig¬ 
lets born; the regressions of mean breeding 
value for backfat deviated from the control were 
0.14 ± 0.06 and 0.34 ± 0.06mm thickness/ 
generation, respectively, for selection lines 
derived from the Nebraska index and control 
lines. Correlated response in weight at 178 days 
of age was different from zero only in the select 
line derived from the Nebraska index line (1.31 
± 0.20kg/generation). Holl and Robison (2003) 
summarized results from nine generations of 
controlled selection for number born alive using 
lines derived from the Nebraska control line. 
Correlated response in breeding value for back¬ 
fat was not different from zero, but for days to 
104kg it was unfavourable (0.93 ± 0.21 days/ 
generation). In an independent study, Estany 
et al. (2002) reported selection for number born 
alive that generated a genetic superiority in the 
select line of 0.46 live piglets relative to an unse¬ 
lected control. As a correlated result, the select 
line had a greater genetic mean for backfat at 
165 days of age than the control line (1.26 ± 
0.23mm from a midline ultrasound). Overall 
growth and feed efficiency did not differ, but the 
patterns varied during the post-weaning period 
(the select line grew faster and was more effi¬ 
cient early, but grew more slowly and was less 
efficient late in the period). 

Implications 

Available estimates of genetic correlations and 
correlated responses to experimental selection 
present in total a complex and somewhat 
inconsistent picture of the genetic relationships 
between performance and reproduction traits. 
There are several technical factors that proba¬ 
bly contribute to this variability in results. 
Estimation of genetic correlations from covari¬ 
ances among relatives is typically associated 
with large standard errors owing to insufficient 
sample size, although some more recent stud¬ 
ies have made use of field records to increase 
precision. The selection experiments discussed 
here encompass a variety of criteria, objectives 
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and methods. Even among those studies of a 
classical index including ADG and backfat, 
selection varied owing to differences in specific 
index weightings, selection differentials achieved 
and the population in which selection was 
applied. Based on these factors alone, it is not 
surprising that results for correlated changes in 
reproduction traits are mixed. In addition, it is 
difficult to generate the number of observations 
necessary for powerful tests of correlated 
responses in these inherently variable traits of 
reproduction. This lack of precision adds to the 
inconsistency in results and can lead to precari¬ 
ous interpretation of non-significant trends. 

Those differences and challenges associ¬ 
ated with experimental designs undoubtedly 
contribute to the fact that, across much of the 
entire range of seven decades of research results 
reviewed, there are reports of favourable or 
neutral (e.g. Vogt et al. 1963; Morris, 1975; 
Bereskin, 1984; Petry et a/., 2004) and of unfa¬ 
vourable (e.g. Dickerson, 1947; Kerr and 
Cameron, 1995; Imboonta et a/., 2007) 
expected responses in reproduction to selection 
for post-weaning performance traits. Even so, it 
should be possible to draw some conclusions 
regarding the genetic relationship of these trait 
groups as it relates to breeding strategies. 

Several studies reported significant 
responses in components of LTGR or LTFC 
with small or non-significant changes observed 
in traits of reproduction, but some clearly unfa¬ 
vourable estimates of genetic correlations or 
correlated responses were reported. While 
some of the antagonism reported was between 
growth rate and reproduction (Pekoviova etal., 
2002; Holm et a/., 2004; Serenius et a/., 
2004), the potential for adverse effects seems 
most likely when selection is for increased lean¬ 
ness (Berruecos et a/., 1970; DeNise et a/., 
1983; Serenius et a/., 2004; Imboonta et a/., 
2007), LTGR (Chen et a/., 2003; Arango 
et a/., 2005) or LTFC (Kerr and Cameron, 
1995). These adverse effects tended to be 
more significant for second parity litters 
(DeNise et a/., 1983; Holm et al., 2004), con¬ 
sistent with relatively recent reports that similar 
unfavourable associations may exist with sow 
lifetime performance (Clutter, 2009). In addi¬ 
tion, potentially antagonistic associations 
between performance and reproduction traits 
were somewhat more frequent in studies of 


more modern populations of pigs. It is possible 
that genetic correlations between components 
of LTFC and reproduction are not linear, 
and that antagonistic relationships may 
develop as performance reaches new thresh¬ 
olds. This may be particularly true for selection 
approaches that take body fat to extremely low 
levels and/or depress ad libitum energy intake. 
Thus, while the literature does not provide con¬ 
clusive evidence that selection for performance 
traits will have detrimental effects on reproduc¬ 
tion in all populations, antagonistic relation¬ 
ships or negative effects in some studies are 
causes for concern. Breeding strategies and 
objectives for maternal lines should be based 
on covariances between components of LTFC 
and, at least, litter traits, estimated specifically 
for the targeted population. 


Non-additive Genetic Effects 
and Breeding Systems 

The most efficient commercial production of 
pork occurs in a breeding system that opti¬ 
mizes selection on additive genetic effects for 
market pig performance and for reproduction 
in the contributing lines, and maximizes exploi¬ 
tation of non-additive genetic effects (domi¬ 
nance and epistasis) through heterosis. To 
determine the efficiency of a breeding system 
from an overall industry perspective, however, 
the cost of maintaining the pure-line and multi¬ 
plier herds that supply the commercial system 
must also be considered. In this section, esti¬ 
mates of maximum heterosis available for traits 
of post-weaning performance are briefly 
reviewed, and effective breeding systems for 
the pork industry are discussed. 

Heterosis estimates for post-weaning per¬ 
formance traits are included in early reviews of 
the extensive research projects that became the 
basis for the initiation of commercial crossbreed¬ 
ing in pigs (e.g. Sellier, 1976; Johnson, 1980, 
1981; Buchanan, 1987), as well as reported 
from additional more recent studies of crosses in 
experimental herds (e.g. McLaren et al., 1987; 
Baas et al., 1992; Cassady et al., 2002). 
Individual, maternal and paternal heterosis, the 
difference in performance due to a crossbred ver¬ 
sus purebred individual, dam or sire, respectively, 
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has been considered. Consistent with theoretical 
expectations, heterotic effects are greatest for 
those traits that have relatively low heritability, 
such as litter sizes and pre-weaning litter weights. 
Accordingly, for components of post-weaning 
performance that are generally moderately herit¬ 
able (see Tables 14.1 and 14.2), relatively moder¬ 
ate to small proportions of heterosis have been 
reported. 

Average estimates of individual heterosis for 
post-weaning ADG reported in the early reviews 
ranged from 6.0% (Sellier, 1976) to 8.8% 
(Johnson, 1981) of the respective purebred 
mean. The estimated percentage of individual 
heterosis for post-weaning ADG averaged across 
more recently reported experiments (McLaren 
et a/., 1987; Baas et a/., 1992; Cassady et al., 
2002) was only slightly greater (10.5%). 
Favourable individual heterosis was reported in 
the reviews by Sellier (1976) and Johnson 
(1981): average estimates of -3.0% for feed/ 
gain and 5.9% for gain/feed, respectively. 
However, Cassady et al. (2002) reported that 
individual heterosis effects for average FI adjusted 
for ADG ranged from 0.10 to 0.15kg/day in 
two separate experiments. A slight unfavourable 
individual heterosis may exist for backfat. 
Johnson (1981) concluded that this unfavourable 
tendency for carcass traits was generally not sig¬ 
nificantly different from zero, a conclusion sup¬ 
ported by the results of Cassady et al. (2002), 
but both McLaren et al. (1987) and Baas et al. 
(1992) reported estimates of direct heterosis for 
backfat that were significantly different from zero 
(3.2% and 7.2%, respectively). As Sellier (1976) 
points out, the amount of heterosis for perform¬ 
ance traits, especially feed conversion, may 
depend on the feeding system (restricted versus 
ad libitum). Effects of maternal heterosis 
were generally small and unimportant for post- 
weaning performance (Johnson, 1981; Cassady 
et al ., 2002). Paternal heterosis, i.e. the advan¬ 
tage of a crossbred boar, may be important for 
the breeding efficiency of young boars, but was 
also negligible for post-weaning performance 
(Buchanan, 1987). 

All three types of heterosis (individual, 
maternal and paternal) can be maximized in a 
static (terminal) cross of FI sires and dams of 
unrelated breed background (e.g. AB sires x CD 
dams). Static crosses also offer the best oppor¬ 
tunity to match complementary strengths of 


lines as sires and dams to fill the paternal and 
maternal roles in the breeding system. As 
described by Sellier (1976), differences in mater¬ 
nal effects are the basis for most of the benefit 
from complementarity in a breeding system, 
and the appropriate choice of a maternal line is 
most obvious for traits of reproduction. However, 
reported differences in feed efficiency (Kuhlers 
et al ., 1972; Johnson et al ., 1973) and carcass 
composition (Bereskin et al., 1971; Johnson 
et al., 1973) between reciprocal crosses indi¬ 
cate that complementarity may also affect post- 
weaning performance traits in the system. 

Fortunately, the static cross is also the 
highest ranking breeding system for pork pro¬ 
duction in terms of overall industry efficiency 
(Dickerson, 1973). The reproductive rate of 
the pig results in a relatively small proportion 
of herds needed for seedstock and multiplier 
production to maintain the static commercial 
cross and, thus, a large proportion of animals 
that realize the maximum benefits of heterosis 
and complementarity. This breeding design is 
also compatible with the biosecurity require¬ 
ments of modern pork production and amen¬ 
able to scenarios in which commercial 
multipliers are populated at the top of the pyr¬ 
amid, and subsequently closed to the introduc¬ 
tions of outside animals (e.g. Clutter, 2009). 

In some situations, the rate of genetic 
improvement in the industry can be maximized 
by implementing specialized selection objec¬ 
tives for each line based on its intended role in 
the breeding system. The objective in paternal 
lines might focus exclusively on LTFC. Because 
maternal lines not only produce the commer¬ 
cial sow herd, but also contribute half the genes 
of market pigs, maternal line selection object¬ 
ives must give relatively equal emphasis to 
LTFC and traits of reproduction. Smith (1964) 
concluded that the pursuit of specialized object¬ 
ives in sire and dam lines would result in at 
least as much overall genetic improvement as 
selection on a general objective, and signifi¬ 
cantly greater improvement if genetic antago¬ 
nism exists between market pig performance 
and traits of reproduction. As discussed in the 
section on the genetic relationships of growth 
performance with reproduction, there is evi¬ 
dence in at least some modern pig populations 
of significant genetic antagonism between pro¬ 
duction and reproduction traits. 
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Conclusions 

The future competitiveness of pork in the food 
market depends on continued genetic improve¬ 
ment in the efficiency of quality lean produc¬ 
tion. From the perspective of post-weaning 
performance of the market pig, great opportu¬ 
nities exist for improvement through selection 
within seed-stock lines. Market pig perform¬ 
ance can be characterized by several compo¬ 
nent traits that respond to selection, but the 
biological index LTFC has been proposed as 
the most appropriate expression of the indus¬ 
try’s objective for this phase of production. 

Investigations of models describing tissue 
growth have yielded strategies for perform¬ 
ance testing and selection to improve LTFC. 
Results from selection experiments have been 
reported which support the hypothesis that 
LTGR and LTFC are perfectly correlated in a 
time-based scale-feeding system, and that 
direct response to selection for LTGR (LTFC) 
among scale-fed candidates will be through 
increased LTGR and decreased fat growth 
without depressed ad libitum feed intake. 
Thus, this or similar testing and selection 
schemes may be effective for seed-stock lines 
supplying commercial systems that allow either 
ad libitum or restricted feed intake. 

Knowledge of the genetic relationships of 
LTFC with the animal’s ability to reproduce is 
required to determine the optimum selection 
emphasis for LTFC in the various lines of a 
breeding system. Although significant correlated 
responses in reproductive traits have not been 
detected in some studies of selection for post- 
weaning performance, detrimental effects on 
reproduction have also been reported. The great¬ 
est risk for negative effects on reproduction from 
selection for LTFC probably exists for approaches 
that result in increased leanness through 


decreased genetic potential for ad libitum feed 
intake, and unfavourable effects may become 
more evident after the first parity and reflected 
in the lifetime performance of the sow. 

This chapter has focused on the genetics of 
efficiency in lean tissue growth. The economic 
importance of LTFC is largely driven by the 
modern consumer’s demand for lean cuts of 
meat, but the quality of tissue in the final prod¬ 
uct is also of increasing importance. As a result, 
the genetics of tissue quality is an area of great 
interest (see Chapter 15), and genetic correla¬ 
tions of components of LTFC with quality char¬ 
acteristics must be well understood in order to 
effectively design breeding programmes. 

As an extension to the information 
described herein regarding physiological varia¬ 
tion contributing to performance traits, the 
completion of sequencing of the porcine 
genome and dense-marker association studies 
of performance traits will continue to reveal 
greater detail of the underlying genetic architec¬ 
ture and provide tools for enhancing the accu¬ 
racy of selection (see Chapters 8 and 16). 
Consequently, methods of selection for post- 
weaning performance will continue to evolve to 
optimize the use of genomic information and in 
response to changing global market demands. 
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Introduction 

Traditional quantitative methods have been 
used successfully in the genetic improvement 
of traits economically important for producers, 
including growth rate, feed conversion and 
backfat/carcass leanness. All these traits are 
relatively easy to measure on the live selection 
candidates, and the genetic response is large 


owing to relatively moderate-to-high heritabili- 
ties. While emphasis on production efficiency 
and lean growth continues to be critical for suc¬ 
cess, other traits related to processing func¬ 
tionality and consumer satisfaction are of 
growing importance. If carcass weight, lean¬ 
ness, proportion of quality cuts and processing 
yields are traits important for processors, con¬ 
sumers are interested in eating-quality traits 
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such as juiciness, tenderness, flavour and col¬ 
our. As a result, meat quality has become 
increasingly important for the pork industry, 
and the processors offer branded pork prod¬ 
ucts that are associated with high quality (Huff- 
Lonergan et a/., 1998). The high demand for 
these products triggered a wave of research 
from genetics to meat science to improve meat 
quality and to reduce the production of low- 
quality pork, such as pale, soft and exudative 
(PSE) and red, soft and exudative (RSE) meat. 
Selection, feeding, animal husbandry and 
processing are the major traditional methods 
available for increasing the value of pork qual¬ 
ity (Barbut et a/., 2008). In the last decade, 
advances in molecular genetics and computa¬ 
tional biology, enhanced by the recent avail¬ 
ability of the swine genome sequence, opened 
exciting possibilities for dissecting and under¬ 
standing the genetic factors that influence pork 
quality. These new tools are of particular 
importance because they can be applied to 
generate information about selection candi¬ 
dates early in life without slaughtering the ani¬ 
mals. Several major functional polymorphisms 
affecting a number of traits measured on 
slaughtered animals, such as post-mortem pH, 
colour and tenderness were also identified, and 
are currently used in selection. The availability 
of the next-generation methods for sequenc¬ 
ing, genotyping and profiling gene expression 
significantly influence our ability to identify, 
understand and quantify the expression of 
genes and networks responsible for the varia¬ 
tion of complex quantitative traits. The pur¬ 
pose of this chapter is to review current 
knowledge and the latest progress in the under¬ 
standing of genetic influences on phenotypic 
variation of meat and carcass quality traits. 


Traits that Characterize Meat Quality 
and Carcass Composition 

Skeletal muscle is a unique and complex tissue 
composed mainly of myofibres with various 
contractile and metabolic characteristics. The 
main role of the muscle is the contraction and 
relaxation necessary for the maintenance of 
posture and locomotion. As a direct conse¬ 
quence, muscle consumes energy in the form 


of ATP, which is continuously generated in 
muscle fibres through glycolytic and oxidative 
pathways. Differences in the percentage of 
specific fibre types exist both between and 
within muscles of the same animal. These fibre 
types differ in contractile rate, amount of force 
generated and efficiency in using either glyco¬ 
lytic or oxidative metabolism pathways. Ratios 
of different fibre types vary in diverse muscles, 
and are directed by developmental stage, mus¬ 
cle activity and muscle function. Fibre type 
ratios are genetically defined, and substantial 
differences in fibre type exist between wild and 
domestic pigs (Ruusunen and Puolanne, 2004), 
providing evidence that domestication and 
selection for meat production have resulted in 
a phenotype consistent with more glycolytic 
fibres and rapid growth. 

The conversion of muscle to meat is a 
complex cascade of metabolic events that dic¬ 
tate the ultimate quality of meat product by 
altering the solubility and functionality of native 
proteins. The following events direct the extent 
of the changes during the conversion of muscle 
to meat: 

• a gradual depletion of available energy in 
the form of glycogen, glucose, glucose-6- 
phosphate and ATP, which leads to rigor 
mortis; 

• a decline in muscle pH from near neutrality 
to 5.4-5.8 (under normal circumstances); 

• a rise in ionic strength, in part, because of the 
inability of ATP-dependent calcium, sodium 
and potassium pumps to function; 

• an increasing inability of the muscle cell to 
maintain an environment that protects 
against lipid and protein oxidation; 

• a degradation of myofibrillar, cytoskele- 
tal and intermediate filament proteins. 

It has long been known that the metabolic 
state of muscle at the time of exsanguination is 
critical in controlling the processes of pH 
decline and rigor development in post-mortem 
tissue. Therefore, factors (genetic or environ¬ 
mental) that define muscle metabolism can 
affect meat quality. Variation in carcass, meat 
and quality traits is significantly influenced by 
the genetic differences between the breeds, but 
variation within the breed is also reported - 
this topic is discussed in detail in the section on 
breed variation and combinability. 
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Meat quality is explained by a group of 
fresh meat processing and sensory characteris¬ 
tics that are important for profitability and com¬ 
petitiveness of the swine industry in the food 
market. Processing meat quality traits consist of 
water-holding capacity, colour, firmness, cook¬ 
ing loss and other processing yields, such as the 
Napole yield (an indicator of the technological 
yield of cured-cooked ham processing: the ratio 
of cooked to raw weight after curing and cook¬ 
ing a 100 g meat sample from the semimem¬ 
branosus muscle). Sensory traits are assessed 
by trained panels or by consumer surveys, and 
involve appearance (colour and marbling), eat¬ 
ing quality as represented by traits that evaluate 
the texture of uncooked meat, and the tender¬ 
ness, juiciness and flavour of cooked meat. 

Glycogen metabolism in the skeletal muscle 
directly influences colour reflectance post¬ 
mortem. This correlates with several technologi¬ 
cal traits, such as water-holding capacity, drip 
loss (the amount of moisture or purge lost over a 
period of time), tenderness and cooking loss 
(Sellier, 1998). As a result, pH at lh post¬ 
mortem (pHJ and pH on the day after slaughter 
(ultimate post-mortem pH, or pHJ are two meat 
quality indicators used broadly for predicting the 
technological and eating quality of pork (Sellier, 
1998; Barbut et aI., 2008). Composition and 
intramuscular lipid content can also have some 
influence on sensory quality. In general, as lipid 
content increases, star probe values (a measure 
of tenderness) as well as sensory chewiness 
scores decrease, especially in pork with interme¬ 
diate pH values (Lonergan et a/., 2007). 

The traits most often used to characterize 
carcass composition are dressing percentage or 
the proportion of the hot carcass from the live- 
weight of the animal, carcass length, carcass lean 
percentage, loin muscle area and backfat thick¬ 
ness. Backfat depth is the preferred measure of 
carcass composition because: (i) it can be accur¬ 
ately measured on live pigs using ultrasound 
scanning instruments; and (ii) it represents a good 
predictive value of carcass fat percentage. 

Heritability of Meat Quality Traits and 

Carcass Composition 

The first heritability ( h 2 ) estimates for meat 
quality were reported in the early 1960s 


(Duniec et a/., 1961; Jonsson, 1963; Ollivier 
and Mesle, 1963). Since then, h 2 estimates 
have been reported for many meat quality 
traits, with colour and pH u being the most 
studied. A comprehensive review of these 
reports was presented by Sellier (1998), and 
the most recent reported estimates are within 
the previously reported ranges. A summary 
of the average and range of h 2 values reported 
by Sellier (1998) is presented in Table 15.1 

for most of the meat and fat quality traits. 
Meat quality traits have in general low-to- 
moderate heritability, as the average values 
of many studies fall into the range 0.10- 
0.30. In the group of sensory traits, the h 2 of 
tenderness measured by subjective and objec¬ 
tive determinations (h 2 = 0.25-0.30) is more 
heritable than flavour and juiciness ( h 2 < 
0.10). In the group of technological traits, 
the average heritability of colour was one of 
the highest (0.28, range 0.15-0.57), com¬ 
pared with cooking and drip loss (0.16) or 
pH u (0.21). The high heritability estimates 
for Napole yield and muscle glycolytic poten¬ 
tial involved studies of lines that were inform¬ 
ative for the Rendement Napole (RN) locus 
(one cause of the RSE condition in pork). 
The substantial effect of the RN allele is 
responsible for the high heritability estimate 
for Napole yield and glycolytic potential in a 
study by Le Roy et at. (1994a). In another 
study of glycolytic potential, in a Large White 
line where it is highly likely that the RN 
allele is absent, the value of heritability 
was considerably lower (h 2 = 0.25; Larzul 
et a/., 1995). 

Carcass composition traits are moderately- 
to-highly heritable. The h 2 values of dressing 
percentage vary from 0.30 to 0.35, and for 
carcass length they vary from 0.55 to 0.60 
(Table 15.2). Backfat thickness was the most 
studied trait from this class of traits and the 
average of the h 2 values is very similar to that 
estimated for firmness (0.43). Similar average 
h 2 values were estimated for loin muscle area 
(0.47) and lean percentage (0.48). These high 
heritabilities would be expected because 
breeders have known for over a century that 
backfat depth is relatively easy to change by 
selection. 

The water, stearic and linoleic acid con¬ 
tents in subcutaneous fat have h 2 estimates that 
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Table 15.1. Range of heritability for meat and fat quality traits (updated from Sellier, 1998). 


Trait 

Number and range of 
heritability estimates 

General quality traits 

pH, 3 

14 

0.04-0.41 

pH u b 

33 

0.07-0.39 

Colour (light reflectance, C/EL* value) 

29 

0.15-0.57 

Water-holding capacity 

15 

0.01-0.43 

Drip loss 

10 

0.01-0.31 

Cooking loss 

9 

0.00-0.51 

Technological yield (cooked ham processing) 

3 

0.09-0.40 

Napole yield 0 

5 

0.26-0.78 

Meat quality index 

13 

0.11-0.33 

Visual score of meat quality 

8 

0.10-0.37 

Eating quality traits 

Tenderness (instrumental determination) 

10 

0.17-0.46 

Tenderness (sensory panel score) 

9 

0.18-0.70 

Flavour (sensory panel score) 

6 

0.01-0.16 

Juiciness (sensory panel score) 

8 

0.00-0.28 

Overall acceptability (sensory panel score) 

2 

0.16-0.34 

Muscle composition traits 

% water 

7 

0.14-0.52 

% lipid 

19 

0.26-0.86 

% protein 

1 

— 

% glycogen (glycolytic potential) 

3 

0.25-0.90 

Fat composition traits (backfat) 

% water 

2 

0.27-0.42 

% lipid 

1 

— 

% stearic acid (C18:0) d 

3 

0.30-0.57 

% linoleic acid (Cl 8:2) d 

3 

0.59-0.67 

Androstenone level (entire males) 

5 

0.25-0.88 

Other traits 

Intensity of boar taint (score) 

1 

— 

Firmness of backfat (score) 

1 

— 


a pH 1t 1 h post-mortem pH. 
b pH u , ultimate post-mortem pH. 

c Napole yield is an indicator of the technological yield of cured-cooked ham processing: the ratio of cooked to raw weight 
after curing and cooking a lOOg meat sample from the semimembranosus muscle. 

Expressed in % of total fatty acids. 


Table 15.2. Average heritability estimates for carcass composition traits (Sellier, 1998). 


Reference 


Trait 

Stewart and Schinckel (1989) 

Ducos (1994) 

Ultrasonic backfat thickness 

0.41 

0.45 

Fat depth over the tenth rib 

0.52 

— 

Loin muscle area 

0.47 

0.48 

Lean percentage 

0.48 

0.54 

Dressing percentage 

0.30 

0.36 

Carcass length 

0.56 

0.57 
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range from 0.35 to 0.64, which are quite a 
lot higher than the rest of the meat quality 
traits and more in line with the h 2 for carcass 
fatness (Sellier, 1998). Similarly, the average 
heritability for fat content in muscle was con¬ 
siderably higher (0.50) than the content of 
water and protein percentage (0.22 and 0.20). 
These reports are not surprising since a lipid 
profile is generally affected by carcass fatness 
(Sellier, 1998). 

Heritability of boar taint score, an unpleas¬ 
ant odour and flavour of the meat from intact 
male pigs, is moderate to high (0.54). High 
levels of androstenone and skatole in adipose 
tissue are considered the primary sources of 
boar taint. Androstenone is a steroid synthe¬ 
sized in the testicles that causes a pronounced 
urine-like odour and flavour in meat. The level 
of androstenone is highly heritable, with an 
average of 0.56 and ranging from 0.25 to 
0.88 (Sellier, 1998; Sellier et a/., 2000; 
Tambyrajah et a/., 2004). An additional source 
of odour is indoles, which are produced by 
bacteria in the colon during the breakdown of 
the amino acid tryptophan (Jensen et ai, 
1995); these give meat a faecal-like odour and 
flavour (Weiler et a/., 2000). Skatole is an 
indole and its level is characterized by a mod¬ 
erate heritability, ranging from 0.19 to 0.54 
(Pedersen, 1998; Tajet et a/., 2006). It is not 
yet fully understood why the high levels of ska¬ 
tole in fat tissue exist in boars but not in cas¬ 
trates and gilts. 

The moderate-to-high heritability of car¬ 
cass traits and the moderate heritability of 
some of the meat quality traits make them 
relatively ideal for genetic improvement using 
traditional selection methods. However, meas¬ 
urement of some of these traits is relatively 
expensive and needs to be performed post¬ 
mortem. Selection of purebred individuals 
based on the performance of crossbred com¬ 
mercial products is another option, but this 
will significantly increase the generation inter¬ 
val. Therefore, this group of traits is one for 
which the application of genomic tools is espe¬ 
cially helpful. Training data of the marker 
effects generated on commercial terminal 
lines, and updated regularly, can be integrated 
into the breeding values and used in the suc¬ 
cessive generations. 


Genetic Relationships Between 
Carcass Composition, Meat and 

Quality Traits 

Genetic correlations between carcass 

composition traits 

Genetic improvement of carcass quality has still 
been achieved for some of the carcass traits 
despite the disadvantages associated with phe- 
notyping. For example, in the last 40 years, a 
strong emphasis on selection for increased lean¬ 
ness has utilized the strong negative genetic cor¬ 
relation between ultrasound measurement of 
backfat thickness on live animals and carcass 
lean content. The magnitude of this genetic cor¬ 
relation ranged from -0.70 to-0.90, and is not 
influenced by differences in carcass leanness 
observed in different breeds (Ducos et a/., 1993; 
Bidanel and Ducos, 1995, 1996; Labroue et a/., 
1996). Average values of genetic correlations 
(r A ) among major body and carcass composition 
traits have been summarized by Sellier (1998) 
based on the reviews of Stewart and Schinckel 
(1989) and Ducos (1994), and are presented in 
Table 15.3. Carcass length is shown here to be 
negatively correlated with loin muscle area 
(-0.18) and tenth rib backfat (-0.21) but posi¬ 
tively correlated with total lean content (0.18). 


Genetic correlations between meat 

quality traits 

Estimates of genetic correlations between meat 
quality traits are affected by the segregation of 
alleles from major loci that significantly affect the 
variation of these traits in certain populations. 
For example a near ‘-1’ (Le Roy et a/., 1994b) 
genetic correlation was estimated between mus¬ 
cle glycolytic potential and Napole technological 
yield of cured-cooked ham in a commercial line 
segregating for the RN~/rn + locus, while the 
same correlation estimated in a population pre¬ 
sumably fixed for the rn + allele was considerably 
lower (r A = -0.50; C. Larzul, personal communi¬ 
cation, 1996, cited by Sellier, 1998). A review of 
the average values of the genetic correlations 
between the main indicators of pork quality, such 
as pH l5 pH u , muscle glycolytic potential and 



360 


D.C. Ciobanu et al. 


Table 15.3. Average values of genetic correlations among carcass composition traits (Sellier, 1998). 


Reference 



Traits 

Stewart and 
Schinckel (1989) 

Ducos (1994) 

Lean percentage 

Ultrasonic backfat thickness 

— 

-0.65 


Tenth rib fat depth 

-0.87 

— 


Loin muscle area 

0.65 

— 


Killing-out percentage 

-0.10 

0.20 


Carcass length 

0.18 

— 

Tenth rib fat depth 

Loin muscle area 

-0.38 


Ultrasonic backfat 

Dressing percentage 

Carcass length 

Dressing percentage 

0.19 

-0.21 

0.18 

thickness 

Loin muscle area 

Dressing percentage 

0.50 

_ 


Carcass length 

-0.18 

— 

Dressing percentage 

Carcass length 

-0.32 

— 


intramuscular fat content, and the main process¬ 
ing and sensory traits is presented in Table 15.4, 
based on Sellier (1998). These studies show that 
pH u is a good predictor of cooking loss (r A range 
-0.82 to -0.45), drip loss (-0.99 to -0.50), ten¬ 
derness (0.40 to 0.68) and overall acceptability 
(r A = 0.59). In addition to high pH u , another 
good indicator of low drip loss is a smaller myo- 
fibre diameter (r A = 0.73) (Dietl et a/., 1993). 
A good predictor of overall acceptability is repre¬ 
sented by intramuscular fat (r A = 0.61). 

Interestingly, intramuscular fat is a less accurate 
genetic predictor of objective tenderness (r A = 
0.15) than pH u (r A = 0.49) (Sellier, 1998). 

Genetic correlations between carcass 
composition traits with meat and fat 

quality traits 

A moderate genetic antagonism was reported 
between the main indicators of meat quality, 
such as pH u , reflectance and water-holding 
capacity, and carcass lean-to-fat ratio (-0.21 to 
0.26) (Table 15.5). A higher pH, lower colour 
reflectance and higher water-holding capacity 
are associated with less lean carcasses. A similar 
trend was observed for cooking loss: an increase 
in cooking loss seems to be associated with 
higher fat-to-lean ratio (Sellier, 1998) 
(Table 15.5). Relatively stronger correlations 
were observed between eating quality traits and 
carcass composition: all eating quality traits (ten¬ 
derness, juiciness, flavour and overall acceptabil¬ 


ity) are negatively correlated with carcass leanness 
(-0.18 to -0.48) and positively correlated with 
carcass fatness (0.24 to 0.34) (Sellier, 1998). As 
expected, the carcass leanness and fatness are 
genetically correlated with intramuscular fat con¬ 
tent, but the estimated values are lower than 
expected (-0.34 and 0.30, respectively). 

In contrast, compositional traits of the sub¬ 
cutaneous fat are moderately to strongly corre¬ 
lated with the lean-to-fat ratio of the carcass. A 
higher lean-to-fat ratio is genetically associated 
with a reduced lipid or stearic acid composition 
and higher linoleic acid content and moisture. 
For example, saturated fatty acid content (Cl8:2) 
is positively correlated with carcass leanness 
(r A = 0.60) and negatively with carcass fatness 
(r A = -0.70) (Table 15.6). Linoleic acid is derived 
from the diet. As pig fatness increases, there is a 
greater proportion of synthesized lipids, so the 
adipose fatty acid profile is directly linked to car¬ 
cass fatness (Fiedrafita eta/., 2001). 


Major Functional Alleles that Affect 
Carcass Composition and Meat 

Quality Traits 

In the last decade, several major functional poly¬ 
morphisms affecting a number of carcass meat 
and quality traits measured on slaughtered ani¬ 
mals, such as post-mortem pH, colour and ten¬ 
derness, were identified and are currently used 
in selection (Table 15.7). Genetic tests for most 
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Table 15.4. Average values of genetic correlations among meat quality traits (Sellier, 1998). 


Genetic correlation 


Traits 3 Mean Range of estimates 


Drip loss 

pH, 

-0.27 

-0.55 to 0.01 


PH U 

-0.71 

-0.99 to -0.50 


Reflectance 

0.49 

0.49 to 0.49 


WHC 

-0.94 

-0.99 to -0.90 


IMF 

-0.08 

-0.23 to 0.05 


Myofibre diameter 

0.73 

— 

WHC 

pH, 

-0.65 

— 


PH U 

0.45 

0.26 to 0.92 


Reflectance 

-0.39 

-0.66 to -0.18 


IMF 

0.12 

0.02 to 0.22 

Cooking loss 

pH, 

-0.14 

-0.23 to 0.04 


PH U 

-0.68 

-0.82 to -0.45 


Reflectance 

0.26 

0 to 0.47 


WHC 

-0.25 

-0.30 to -0.21 


IMF 

0.07 

-0.03 to 0.23 


Drip loss 

0.66 

0.45 to 0.80 

Napole yield or technological yield 

pH u 

0.70 

0.26 to 0.99 


GP in vivo 

-0.83 

-1 to -0.5 

Reflectance 

pH, 

-0.38 

— 


PH U 

-0.53 

-0.66 to -0.38 


IMF 

0.01 

-0.12 to 0.15 

Tenderness 

pH, 

0.27 

— 


PH U 

0.49 

0.40 to 0.68 


Reflectance 

-0.16 

-0.22 to -0.08 


WHC 

0.23 

0.08 to 0.41 


IMF 

0.15 

-0.08 to 0.53 


Drip loss 

-0.16 

-0.19 to -0.14 


Cooking loss 

-0.46 

-0.57 to -0.401 

Overall acceptability 

pH u 

0.59 

— 


Reflectance 

-0.02 

— 


WHC 

0.46 

— 


IMF 

0.61 

0.54 to 0.68 


a WHC, water-holding capacity; pH„ 1 h post-mortem pH; pH u , ultimate post-mortem pH; IMF, intramuscular fat content; 
GP, muscle glycolytic potential; Napole yield, an indicator of the technological yield of cured-cooked ham processing; 
tenderness, instrumental determination or sensory panel score; overall acceptability, sensory panel score. 


of these polymorphisms are commercially avail¬ 
able to the swine breeding industry for genetic 
improvement of meat quality. 

Ryanodine receptor 1 

The introduction of molecular genetics in pig 
genetic improvement was initiated by the dis¬ 
covery of a recessive allele in the ryanodine 
receptor 1 (RYR1) gene that is responsible for 
porcine stress syndrome (PSS), or malignant 


hyperthermia syndrome (MH) (Fujii et a/., 
1991). Pigs homozygous for this allele are sus¬ 
ceptible to PSS, and are likely to develop the 
pale, soft, exudative (PSE) meat condition 
(Briskey, 1964). 

Christian (1972) hypothesized that PSS is 
a result of an autosomal variation that is reces- 
sively inherited. Eikelenboom and Minkema 
(1974) revealed that the anaesthetic halothane 
can trigger PSS or MHS in genetically 
susceptible pigs. Several reports of halothane 
challenge tests confirmed the recessive 
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Table 15.5. Average values of genetic correlations of meat and eating quality traits with carcass 
composition traits (Sellier, 1998). 


Genetic correlation with 

Trait 3 

Carcass leanness 

Carcass fatness 

Mean 

Range 

Mean 

Range 

PH 1 

0.10 

— 

0.26 

— 

PH U 

-0.13 

-0.50 to 0.08 

0.15 

-0.05 to 0.45 

Reflectance 

0.16 

-0.16 to 0.42 

-0.21 

-0.48 to 0.07 

Water-holding capacity 

-0.19 

-0.57 to 0.24 

0.02 

-0.25 to 0.24 

Drip loss 

0.05 

-0.10 to 0.13 

-0.10 

-0.20 to -0.01 

Cooking loss 

-0.07 

-0.16 to -0.06 

0.12 

-0.04 to 0.39 

Napole yield 

— 

— 

0.15 

-0.12 to 0.41 

Muscle glycolytic potential 

0.40 

— 

-0.21 

-0.34 to -0.10 

Meat quality index (IQV) 

-0.23 

-0.44 to 0.06 

0.18 

0.01 to 0.39 

Intramuscular fat content 

-0.34 

-0.55 to -0.07 

0.30 

0.04 to 0.60 

Tenderness 

-0.20 

-0.48 to 0.12 

0.24 

0.11 to 0.48 

Juiciness 

-0.18 

-0.47 to 0.08 

0.29 

-0.19 to 0.85 

Pork flavour 

-0.27 

-0.60 to 0.02 

0.35 

-0.03 to 0.72 

Overall acceptability 

-0.48 

-0.71 to -0.32 

0.34 

-0.04 to 0.70 


a pH 1f 1 h post-mortem pH; pH u , ultimate post-mortem pH; Napole yield, an indicator of the technological yield of 
cured-cooked ham processing. 


Table 15.6. Average values of genetic correlations 
of qualitative characteristics of subcutaneous backfat 
with carcass composition traits (Sellier, 1998). a 



Genetic correlation 

Backfat 

Carcass 

Carcass 

characteristic 

leanness 

fatness 

% water 

0.35 

-0.65 

% lipid 

-0.75 

0.70 

% Cl 8:0 

-0.40 

0.40 

% Cl 8:2 

0.60 

-0.70 

Firmness score 

-0.40 

0.70 


a Data from Schworer et al. (1988), Bout et al. (1989) and 
Cameron (1990). 


inheritance of the newly called halothane 
(HAL) locus and the PSS/MHS phenotype 
(Ollivier et a/., 1975; Smith and Bampton, 
1977). This early tool enabled breeders to dis¬ 
tinguish individuals with a normal phenotype 
(genotype NN and Nn) from individuals sus¬ 
ceptible to PSS (nn). Linkage analyses of the 
halothane sensitivity mapped HAL close to the 
H red blood cell antigens, phosphohexose iso- 
merase ( PHI) and 6-phosphogluconate dehy¬ 
drogenase ( 6-PGD) (Rasmusen et a/., 1980). 


Table 15.7. Functional polymorphisms that affect 
meat and/or carcass traits in the pig. 


Genes 

Trait 

Reference(s) 

RYR1 a 

Malignant 

Fujii et al., 


hyperthermia 
syndrome (MHS), 
pale soft and 
exudative (PSE) 
meat 

1991 

PRKAG3 b 

Glycogen 

Milan et al., 


metabolism, 

2000; 


Napole yield, pH 

Ciobanu 


and colour 

et al., 2001 

IGF2 C 

Carcass 

Van Lae re 


composition, 
backfat thickness 

et al., 2003 

MC4R d 

Carcass 
composition, 
backfat thickness 

Kim et al., 2000 


a ryanodine receptor 1 gene; b protein kinase AMP- 
activated gamma 3-subunit gene; c insulin-like growth 
factor 2 gene; d melanocortin 4 receptor gene. 


In situ hybridization of the linked glucosephos- 
phate isomerase (GPI) gene assigned the HAL 
locus on SSC6 (Sus scrofa chromosome 6) 
(Davies et al., 1988). 
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In skeletal muscle, contraction and metab¬ 
olism are regulated by the concentration of 
intracellular Ca 2+ (O’Brien, 1987; Britt, 1991), 
and RYR1 was considered a good positional 
candidate gene for the observed effects because 
it modulates the release of Ca 2+ by the endo¬ 
plasmic reticulum (MacLennan and Phillips, 
1992). Fujii et al. (1991) uncovered a single 
missense substitution in RYR1 (pArg615Cys, 
commercially known as HAL-1843) responsi¬ 
ble for the variation of the PSS phenotype. The 
n allele was present at relatively high frequency 
in several populations as a result of selection for 
increased lean meat content. The HAL locus is 
expressed in an approximately additive manner 
for the carcass lean percentage, loin muscle 
area, dressing percentage, pHi and meat colour 
score (light reflectance, the CIE L* value) 
(reviewed by Sellier, 1998). The discovery of 
this mutation enabled breeders to exploit the 
intermediate advantages in leanness and other 
meat quality traits of the heterozygotes (Nn) 
while controlling the negative effects of PSS in 
nn homozygotes (Sellier, 1998). 

Otto et al. (2007) investigated the relation¬ 
ship between 12 DNA markers, including RYR1- 
pArg615Cys, and several meat quality traits 
measured on 1155 market pigs. As expected, 
the pArg615Cys substitution had the largest 
effect on meat quality. Heterozygous individuals 
were significantly inferior for all meat quality 
traits with the exception of pH, initial conductiv¬ 
ity and meat redness. Drip loss measured from 1 
to 7 days post-mortem was 43% higher in 
heterozygotes than in animals of the stress- 
resistant genotype. Commercial testing of this 
mutation is available and the reduction of the 
deleterious allele is being practised worldwide. 

Protein kinase AMP-activated 
gamma 3-subunit 

Early studies of the variation in the processing 
yield of ham uncovered a locus that segregates in 
Hampshire populations (Monin and Sellier 
1985; LeRoy et al., 1990). The locus, called 
Rendement Napole (RN), is characterized by the 
presence of the dominant RN~ allele that is 
responsible for an increase of approximately 
70% in muscle glycogen content, primarily in 
the skeletal muscle, compared with the recessive 


rn + allele (Monin et a/., 1992; Estrade et a/., 
1993). The distribution of glycogen content 
is bimodal, with almost no overlap between RN~ 
homozygotes and carriers. Individuals that carry 
at least one copy of the RN~ allele produce meat 
with a lower ultimate pH as a result of post¬ 
mortem degradation of the increased glycogen 
content. The RN~ allele is almost completely 
specific to the Hampshire breed and its fre¬ 
quency was increased as a result of selection for 
lean meat content (Enfalt et al., 1997; 
Andersson, 2003); although the allele may occur 
in other breeds, its frequency is negligible. 

The RN locus was mapped on SSC15 
(Looft et al., 1996; Mariani et a/., 1996; Milan 
et al., 1996), and a mutation in codon 200 
( p.Arg200Gln ) of the protein kinase AMP- 
activated gamma 3-subunit gene (PRKAG3) 
was found to be responsible for the differences 
in muscle glycogen content (Milan et al., 
2000). PRKAG3 is one of the regulatory sub¬ 
units of AMP-activated protein kinase (AMPK), 
which plays a key role in regulating energy 
homeostasis in eukaryotes (Hardie et a/., 
1998). Environmental or nutritional stress fac¬ 
tors could affect the AMP/ATP ratio, which 
triggers the ‘AMPK cascade,’ which, in turn, 
initiates measures to conserve energy (Thornton 
et a/., 1998) and activate ATP synthetic path¬ 
ways (Hardie et a/., 1998). This mutation is 
located in an extremely well-conserved region 
that directly binds AMP and initiates the pro¬ 
cess of AMPK activation (Barnes et al., 2004). 

Using a three-generation intercross 
between Berkshire and Yorkshire (B x Y) pigs, 
a QTL (quantitative trait locus) affecting muscle 
glycogen content and related meat quality traits 
(colour and pHJ was also mapped to SSC15 
(Malek eta/., 2001b). The PRKAG3 gene was 
considered to be a good candidate for the 
observed effects. The RN~ mutation was not 
present in the cross, but three missense substi¬ 
tutions were identified (p.Thr30Asn , p. 
Glg52Ser, p.Ilel99Val). To confirm the 
hypothesis that additional alleles in PRKAG3 
were associated with differences in muscle gly¬ 
cogen content, 1800 animals from five com¬ 
mercial lines were analysed for associations 
between candidate polymorphisms and ham 
and loin pH u and Minolta L and b colour scores. 
The results showed that the p.Ilel99Val poly¬ 
morphism is associated with the largest effects. 
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In a different study, Otto et al. (2007) showed 
that the homozygote genotype for the 
PRKAG3-Ilel99 allele was associated with less 
drip loss, higher pH u and darker meat than the 
rest of the genotypes. p.Ilel 99Val is a conserv¬ 
ative substitution, and is located next to codon 
200 where the RN~/rn + (p.Arg200GIn ) poly¬ 
morphism is present in Hampshire populations. 
The effect of the PRKAG3 pl99IIe_200Arg 
haplotype is opposite to the effect of 
199Val_200Gln as it reduces muscle glycogen 
content and increases pH post-mortem. The 
favourable PRKAG3 Uel99 allele is prevalent 
in all common commercial breeds and the 
potential implication for improving pork quality 
is significant. The discovery that the mutation 
in PRKAG3 other than the classic RN geno¬ 
type influences meat quality demonstrates that 
different changes (alleles) in key metabolic pro¬ 
teins have the potential to affect meat quality. 

Calpastatin 

Calpastatin (CAST) is a specific inhibitor of 
p- and m-calpain proteases and for many years 
was considered a good functional candidate 
gene that could affect tenderness. Calpain pro- 
teinases (specifically p-calpain) are known to 
catalyse protein degradation in post-mortem 
muscle and improve meat tenderness (Huff- 
Lonergan et al., 1996). Inhibition of calpains by 
calpastatin can affect the rate and extent of ten- 
derization of meat (Koohmaraie et al., 1991; 
Sensky et al., 1998; Parr et al., 1999). The 
first genetic evidence showing that the CAST 
gene is not only a potential functional candidate 
but also harbours genetic variants that influence 
tenderness was reported by Ciobanu et al. 
(2004). Using the B x Y reference population, 
Malek et al. (2001b) mapped a suggestive QTL 
to SSC2 that affected raw firmness scores, aver¬ 
age Instron force, tenderness, juiciness and 
chewiness of cooked meat. CAST was consid¬ 
ered to be a good functional and positional can¬ 
didate for the B x Y QTL, and cDNA 
(complementary DNA) sequencing of CAST in 
B x Y and commercial lines uncovered 12 mis- 
sense and silent mutations (Ciobanu et al., 
2004). Haplotypes covering most of the coding 
region were constructed, and two SNPs (single 
nucleotide polymorphisms) (p.Arg249Lys and 


WA 


p.Ser638Arg) that specified common haplo¬ 
types were used for association analyses with 
meat quality traits. Results demonstrated that 
one CAST haplotype (249Lys-638Arg) was 
significantly associated with lower Instron force 
(indicating greater tenderness) and higher juici¬ 
ness sensory scores in B x Y, and lower cooking 
loss and higher juiciness in a commercial 
crossbred population (Ciobanu et al., 2004). In a 
different study, CAST p.Ser638Arg genotypes 
were associated with more efficient moisture loss 
in dry-cured hams (Stalder et al., 2005). 

The presence of the SSC2 QTL responsi¬ 
ble for variation in tenderness measures was 
subsequently identified in other populations 
(Stearns et al., 2005a; Rohrer et al., 2006; 
Meyers et al., 2007). Meyers and Beever (2008) 
fine mapped the QTL previously discovered 
(Stearns et al., 2005a; Meyers et al., 2007) 
and provided additional evidence that CAST is 
the most likely gene affecting the tenderness 
measurements. Lindholm-Perry et al. (2009) 
recently assessed the associations between sliced 
shear force (SSF) and the genotypes of 40 mark¬ 
ers, including nine CAST polymorphisms that 
cover the SSC2 QTL region from 64.8 to 
93 Mb. Only three of the CAST polymorphisms 
were significantly associated with SSF, including 
a novel intronic SNP responsible for the largest 
effect, and two missense SNPs from Ciobanu 
et al. (2004) {p.Ser66Asn and p.Arg249Lys). 
The same intronic mutation was significantly 
associated with variation in CAST gene expres¬ 
sion in the longissimus lumborum muscle. 

In the most recent study, Nonneman et al. 
(2010) detected 194 SNPs in the promoter 
region of calpastatin. Twenty-nine of the SNPs 
were located in transcription factor binding 
sites, and gel shift assays for five of the poly¬ 
morphic sites demonstrated a shift when 
probes were incubated with nuclear extracts 
from muscle, heart or testes. Four of these 
polymorphisms showed allele specificity in 
binding nuclear proteins that could alter expres¬ 
sion of CAST and affect tenderness by regulat¬ 
ing calpain protease activity. 

Insulin-like growth factor 2 

A paternally expressed QTL influencing muscle 
growth, fat deposition and heart size was 
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identified in crosses between European wild 
boar and Large White (WB x LW) and between 
Pietrain and Large White (P x LW) (Jeon et a/., 
1999; Nezer et a!., 1999). This QTL explained 
15-30% of the phenotypic variation in lean 
mass and 10-20% of the variation in backfat 
thickness. The QTL position coincides with the 
position of the imprinted insulin-like growth 
factor 2 gene ( IGF2 ). A substitution located in a 
conserved CpG island in intron 2 of IGF2 was 
found responsible for the phenotypic differ¬ 
ences (Van Laere et a/., 2003). The pigs inher¬ 
iting the mutant allele from their sire have a 
threefold increase in IGF2 expression in post¬ 
natal muscle and an increase of 3-4% in muscle 
mass. It was estimated that approximately 50% 
of the differences in lean mass between Pietrain 
and Large White pigs are caused by RYR1 and 
IGF2 polymorphisms (Nezer et a/., 1999). 


Other genes and DNA sequence variants 

The large number of defined markers for meat 
quality phenotypes is not surprising because 
genetic changes that affect muscle metabolism 
and cellular function have the potential to influ¬ 
ence the key events during the conversion of 
muscle to meat and/or the protein and lipid 
profile of meat. In the last decade, variants of 
many candidate genes were analysed for their 
effects on several carcass and meat quality 
traits. Different alleles in the melanocortin 4 
receptor gene ( MC4R ) were shown to be asso¬ 
ciated with carcass composition. A missense 
polymorphism in MC4R (p.Asp298Asn) was 
associated with differences in feed intake and 
average backfat thickness in commercial popu¬ 
lations (Kim et a/., 2000). In a new study, Fan 
et al. (2009) discovered additional SNPs in the 
MC4R of the B x Y resource and found that 
the new p.Arg236His SNP influenced backfat 
and growth, while the p.Asp298Asn was pri¬ 
marily associated with variation in growth rate. 
An interaction between the effects of the two 
SNPs found for average daily gain could explain 
some of the previous discrepancies reported 
for MC4R in different pig populations (Park 
et a/., 2002; Stachowiak et a/., 2006). SNPs 
in genes from the cathepsin family (CTSB, 
CTSD , CTSF , CTSH , CTSK , CTSZ and 


CTSL) known to influence protein degradation 
were associated with meat production and car¬ 
cass traits (Russo et a/., 2002, 2008; Fontanesi 
et a/., 2010a,b). 

The microsomal triglyceride transfer pro¬ 
tein (MTTP) gene plays an important role in the 
assembly of nascent lipoproteins, and was ana¬ 
lysed as a positional candidate gene for a QTL 
on SSC8 affecting the palmitic, palmitoleic and 
oleic fatty acids in an Iberian x Landrace (I x 
LR) F 2 cross (Clop et a/., 2003). A conserved 
non-synonymous mutation (p.Phe840Leu ) in 
the MTTP gene was found to be significantly 
associated with MTTP lipid transfer activity 
and fatty acid profile in porcine fat (Estelle 
et a/., 2009). 

Otto et al. (2007) analysed the associa¬ 
tions between 12 DNA markers, including 
RYRl-pArg615Cys and PRKAG3-I199V , and 
meat quality traits measured on 1155 market 
pigs. While the RYRl-pArg615Cys and 
PRKAG3-p.Ilel 99Val polymorphisms affected 
many meat quality traits, some of the other 
markers were also found to influence the varia¬ 
tion of some of the traits. SNPs in MC4R 
( p.Asp298Asn ) and high-mobility group AT 
hook 1 (HMGA1) significantly affected drip 
loss, while SNPs in the lactate dehydrogenase 
A gene (LDHA), CAST (p.Arg249Lys) and 
ATPase Ca 2+ transporting fast twitch (ATP2A1) 
influenced pH. A SNP located in ATP2A1 was 
associated with differences in intramuscular fat 
content in the longissimus dorsi muscle, while 
a SNP in glucose transporter 4 (GLUT4) 
affected temperature 45 min post-mortem. 
SNPs in MC4R ( p.Asp298Asn ), LDHA, 
GLUT4, HMGA1 and CAST [p.Arg249Lys 
and p.Ser638Arg) influenced meat colour. 


QTL Mapping 

Since 1994 a large number of QTL mapping 
studies have been initiated and many QTLs 
responsible for variation in growth, carcass 
composition and meat quality have been 
reported. In the first QTL study in pigs, 
Andersson et al. (1994) used the WB x LW F 2 
experimental population to identify QTLs 
responsible for the differences in growth and 
fat deposition. A significant QTL was identified 
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on SSC4 that was responsible for 20% of the 
phenotypic variation for abdominal and back- 
fat. A considerable number of QTL studies fol¬ 
lowed, and most of them were based on 
resource populations produced by crossing 
breeds that were phenotypically divergent. 
Most of the carcass and meat quality traits are 
quantitative and are affected by many loci that 
have a wide degree of effects on phenotypic 
variation. Conflicting effects and implications 
of most of the chromosomes in the variation of 
the complex meat quality traits is not a surprise 
because allelic segregation of these loci can 
vary from one population resource or cross to 
another, and markers were spaced far apart. 
A summary of the populations developed is 
presented in Table 15.8. Some of the studies 
employed a full genome QTL scan, while oth¬ 
ers focus their analysis on a limited number of 
chromosomes where QTLs were previously 
discovered. The traits that were easily meas¬ 
ured, such as growth, are well represented in 
these studies compared with the traits that are 
more difficult to measure, such as carcass com¬ 
position and meat quality traits. Some of these 
studies were difficult to compare as they often 
employed different types of analyses and used 
different sets of markers. 

The Pig QTL database (PigQTLdb; http:// 
www.genome.iastate.edu/cgi-bin/QTLdb/ 
SS/index) has integrated all the QTL mapping 
results and provided the visualization tools to 
compare swine QTL data (Hu and Reecy, 
2007). The reference map used to display and 
to compare QTL positions was based on the 
map reported by Rohrer et al. (1996). 
Currently, there are 5621 pig QTL results in 
the PigQTLdb associated with 546 different 
traits that were reported in 237 publications. 
The richest traits in QTL reports are for drip 
loss (n = 936), average backfat thickness (149) 
and loin muscle area (110). In the following 
subsections we will provide a description of 
several examples of QTLs that have been 
mapped in different swine populations. 

QTLs for fatness traits 

Significant QTLs for average backfat thickness 
were detected in all porcine chromosomes 
with the exception of SSC15, where only 


suggestive QTLs were located. Consistent 
QTL effects were detected on the same region 
of several swine chromosomes in different 
swine populations or generations. For 
example, the QTLs for abdominal and backfat 
revealed by Andersson et al. (1994) on SSC4 
were later confirmed by Marklund et al. (1999) 
in subsequent generations of the same popula¬ 
tion resource, and later refined by Berg et al. 
(2006). A QTL for backfat thickness was 
revealed at the proximal end of SSC1 by 
Malek et al. (2001b) and by Knott et al. 
(1998). MC4R is located in the same region of 
SSC1 and MC4R alleles were found to be asso¬ 
ciated with feed intake and average backfat 
thickness in commercial populations. Park 
et al. (2002) confirmed the causative role of 
the p.Asp298Asn for any fatness-related traits 
in a WB x LW cross. Additional SNPs were 
identified in the B x Y resource and interac¬ 
tions between the effects of two SNPs 
explained some of the previous inconsistency 
reported for MC4R in different studies (Fan 
et al ., 2009). Two studies of a Meishan- 
derived population at the US Meat Animal 
Research Center (USAMRC) identified QTLs 
that influence backfat depth on SSC1 and 7 
(Rohrer and Keele, 1998a; Rohrer, 2000). 
The QTL effect from SSC7 was confirmed by 
a subsequent study in the F 8 and F 10 generations 
of the USAMRC Meishan-based population 
(Kuehn et al., 2007). 

A region that was well characterized and 
affects backfat is located at the end of the short 
arm of SSC2 harbouring IGF2 locus (Knott 
et al., 1998; Jeon et al., 1999; Nezer et al., 
1999; Rattink et al., 2000; de Koning et al. 
2001). A QTL associated with variation in 
backfat thickness was localized on SSCX by 
Rohrer and Keele (1998a), Rohrer (2000), 
Harlizius et al. (2000) and Cepica et al. (2007). 
A potential candidate gene for this QTL is the 
thyroxine-binding globulin (TBG) gene. A poly¬ 
morphism in TBG was associated with an addi¬ 
tive effect of 1.2 to 1.6 mm of backfat (Kuehn 
et al., 2007). QTLs that affect fatness were 
detected on SSC6 in a Meishan x Large White 
(M x LW; Bidanel et al., 2001), Iberian x 
Landrace (I x LR; Ovilo et al., 2000, 2002), 
B x Y (Malek et al., 2001a), Duroc x Pietrain 
(D x P; Edwards et al., 2008a) and commer¬ 
cial crosses (Grindflek et al., 2001; Mohrmann 



Table 15.8. Main QTL mapping projects in pigs for meat quality and carcass composition (updated from Rothschild et al., 2004). 


No. of 


Population 3 

animals 

Markers 

Traits b 


Reference(s) 

Px LW 

1032 

11 (SSC2) 

GR, CC 


Nezer et al., 1999; Van Laere et al. , 2003 

M x LW 

1103 

137 

GR, CC, 

MQ 

Bidanel et al., 2001; Milan et al., 2002; Sanchez et al., 2006 

MxGMP 

215 

318 

GR, CC 


Wada et al., 2000 

M x LW 

800 

127-132 

CC, MQ 


de Koning et al., 1999, 2000, 2001; Harlizius et al., 2000; 






Rattink et al., 2000 

(D x LR) x LW 

305 

11 (SSC4); 9 (SSC 6); 9 (SSC7) 

MQ 


Grindflek et al., 2001 

M x LW 

390 

9 (SSC4) 

CC 


Walling etal., 1998, 2000 

1 x LR 

500 

7 (SSC6) 

CC, MQ 


Ovilo etal., 2000; Perez-Enciso et al., 2000 


369 

92 (All autosomes) 

MQ, CC 


Ovilo etal., 2002; Varona etal., 2002 


139 

139 

MQ 


Mercade et al., 2005, 2006; Estelle et al., 2009 

WBxLW 

191 

117-236 

CC, MQ 


Andersson etal., 1994; Andersson-Eklund etal., 1998; 






Knott etal., 1998; Jeon etal., 1999 

LW, LR, H, P, M, CoL, 

4400 

25 (SSC1,2, 3, 4, 6, 7, 8, 9, 10, 13) 

GR, CC, 

MQ 

Evans et al., 2003 

LW, D x LW, Y x LW, LW, LR 

2713, 

GR, CC 



Nagamine etal., 2003, 2004, 2009 


1505 





CH xY 

294 

5 (SSC4); 10 (SSC7) 

CC, MQ 


Rothschild etal., 1995; Wang etal., 1998 

BxY 

500 

125 

CC, MQ 


Ciobanu etal., 2001,2004; Malek etal., 2001a;Thomsen 






et al., 2004; Kim et al., 2005; Ramos etal., 2006, 2009a 

M x SL 

540 

157 

CC, GE 


Rohrerand Keele, 1998a,b; Rohrer, 2000 

M xY 

298 

119 (all autosomes) 

CC, MQ 


Paszek et al., 1999 


116 

119 (SSC1,6, 7, 8, 12) 



Paszek et al., 2001 

M x D 

865 

180 

GR, CC 


Sato et al., 2003 

LR 

546 

24 (SSC1.2, 3, 4, 6, 7, 8, 9, 10, 13) 

CC, MQ 


Vidal etal., 2005 

WB x LW 

353 

225 

MQ 


Nii etal., 2006 

Bx D 

832 

30 (SSC2, 6, 13, 18) 

GR, CC, 

MQ 

Stearns et al., 2005a,b; Meyers et al., 2007; Meyers and 






Beever, 2008 

D x LR 

370 

182 

CC, MQ 


Rohrer et al., 2006 

D x BMP 

308 

88 

CC, MQ 


Wimmers et al., 2006 

(P x LW) x AN 

715 

73 (SSC1,2, 4, 5, 6, 8, 10, 11, 12, 13, 14, 

CC, MQ 


van Wijk et al., 2006 



15, 17, 18) 





1855 

26 (SSC2) 

MQ 


Heuven et al., 2009 

CoL 4x cross 

1187 

198 

GR, CC, 

MQ 

Harmegnies et al., 2006 


Continued 
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Table 15.8. Continued. 


Population 3 

No. of 
animals 

Markers 

Traits b 

Reference(s) 

DxP 

585 

106 

GR, CC, MQ 

Liu et al., 2007, 2008 


74 


GE 

Ponsuksili et al., 2008 

D x LW 

775 

91 

MQ 

Sanchez et al., 2007 

DxP 

510 

124 

CC, MQ 

Edwards et al., 2008a,b 

H x LR 

450 

120 

CC, MQ 

Markljung et al., 2008; Karlskov-Mortensen et al., 2006 

D 

1004 

10 (SSC7) 

CC, MQ 

Uemoto etal., 2009 

DxE 

1028 

183 

MQ, CC 

Li etal., 2009; Ma etal., 2009 

M xY 

180 

39 

CC 

Zhang et al., 2007 

Px AN 

315 

51 (SSC2, 4, 8, 9, 10 and 14) 

CC 

Duthie et al., 2008 

M x P 

316 

171 

CC, MQ 

Geldermann etal., 2003 

WB x P 

315 

152 



WB x M 

335 

165 




a AN, Anonymous; B, Berkshire; BMP, Berlin Miniature Pig; CH, Chinese breeds; CoL, commercial lines; D, Duroc; E, Erhualian; GMP, Gottingen Miniature Pig; H, Hampshire, I, Iberian 
breed; LR, Landrace; LW, Large White; M, Meishan; P, Pietrain; SL, Synthetic line; WB, wild boar; Y, Yorkshire. 
b CC, carcass composition; GE, gene expression; GR, growth; MQ, meat quality. 
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et a/., 2006). A non-synonymous substitution 
(pLeu663Phe) in the leptin receptor gene 
(LEPR) was significantly associated with back- 
fat depth in I x M and I x LR and considered 
as a candidate for the QTL (Munoz et ai, 
2009). Two QTLs mapped in the same region 
of SSC5 were associated with first-rib backfat 
in a D x P F 2 cross (Edwards et ai, 2008b) and 
with tenth-rib backfat in a Duroc x Landrace 
(D x LR) F 2 cross (Rohrer et a/., 2006). 
Similarly, two different studies mapped QTLs 
associated with backfat on SSC8 (Rohrer, 
2000; Bidanel et ai, 2001). 

As the majority of the QTLs were revealed 
in crosses that involved non-commercial pigs, 
it would be important to determine whether the 
QTL information could be translated in the 
commercial lines breeding programmes. Several 
QTLs for backfat and growth rate previously 
detected in experimental crosses that usually 
involved non-commercial pigs, such as Meishan 
or wild boar, were analysed for segregation in 
ten different commercial populations (Evans 
et a/., 2003). The study showed segregation of 
QTLs in most of the populations despite the 
strong selection pressure for backfat in all ana¬ 
lysed populations. The most consistent QTL 
effects across populations were located on 
SSC3 and SSC4, and were associated with fat¬ 
ness and growth. In a different study, the segre¬ 
gation of QTLs associated with fatness traits 
and previously located on SSC4 and SSC7 
(Rohrer and Keele, 1998a; Walling et a/., 1998; 
de Koning et a/., 1999; Andersson, 2001; 
Bidanel et a/., 2001) were analysed in five com¬ 
mercial lines (Nagamine et a/., 2003). The most 
consistent evidence of segregation was observed 
for the QTLs located on SSC7. Both studies 
showed that QTLs revealed in experimental 
crosses can be detected in highly selected com¬ 
mercial populations and that QTLs that account 
for variation between populations can also 
explain genetic variation within populations. 


QTLs associated with carcass 
composition traits 

QTLs associated with carcass composition 
traits have been identified in most of the 
swine chromosomes. All chromosomes with 


the exception of SSC15 and SSC16 harboured 
QTLs associated with carcass length. Two sig¬ 
nificant QTLs associated with carcass length 
were mapped on SSC6 and 7 in a D x F cross 
(Edwards et a/., 2008b). The QTL from SSC6 
was associated with a dominance effect, while 
the QTL from SSC7 that influences both car¬ 
cass length and dressing percentage acted in 
an additive manner. The allele derived from 
Duroc was found to increase carcass length 
and decrease dressing percentage. The posi¬ 
tion of the QTL on SSC6 coincides with QTLs 
identified in B x Y (Malek et ai, 2001a), 
Meishan x Duroc (M x D; Sato et a/., 2003) 
and Berkshire x Duroc (BxD; Stearns et ai, 
2005a) crosses. A dominance effect similar to 
that reported by Edwards et ai (2008b) on 
SSC6 was identified by Malek et al. (2001a). A 
QTL located in a similar position on SSC7 was 
reported by Rohrer and Keele (1998a) and 
explained more than 15% of carcass length in 
a M x LW cross, with the Meishan allele 
increasing the trait. Malek et al. (2001a) also 
revealed suggestive QTLs on SSC6, 11 and X, 
with the Berkshire (B) alleles increasing the car¬ 
cass length for two out of three QTLs. All these 
effects were responsible for approximately 
10.6% of the variation. The halothane-sensitive 
n allele from RYR1 is known to positively influ¬ 
ence carcass length and dressing percentage, 
while the RN~ allele from PRKAG3 positively 
influences carcass length (Sellier, 1998; Le 
Roy et a/., 2000). 

QTLs significantly associated with dress¬ 
ing percentage were identified on all chromo¬ 
somes with the exception of SSC5, 9, 11, 12 
and 17. For example, QTLs with significant 
effects on dressing percentage were detected 
on SSC1 in a D x P cross (Edwards et ai, 
2008b), SSC4 in a B x Y cross (Malek et a/., 
2001b), and in the same region of SSC7 in M 
x LW (Rohrer and Keele, 1998b) and D x P 
crosses (Edwards et a/., 2008b). Additional 
QTLs have also been suggested by Rohrer 
and Keele (1998b) on SSC3, by Malek et al. 
(2001b) on SSC7, 8, 13 and 14, and by 
Milan et al. (2002) in a different region of 
SSC4. The effects of Large White alleles were 
always favourable compared with Meishan 
alleles, except for the QTL from SSC8, where 
the effects were unfavourable compared 
with Berkshire alleles. Significant dominance 
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(SSC7 and 8) and even overdominance 
(SSC13 and 14) effects were observed in the 
B x Y cross (Malek et al., 2001b), but not in 
the M x LW crosses. The QTLs explain in 
most cases a relatively small fraction (< 8%) of 
the phenotypic variation. 

Significant QTLs that affect carcass yield 
were mapped on SSC4, 7, 10 and 12 (Sato 
et al., 2003; Thomsen et al., 2004; 
Harmegnies et al., 2006). A consistent QTL 
that influenced variation in carcass yield was 
located on SSC7 in B x Y crosses (Thomsen 
et al., 2004) and in a commercial four-way 
cross that involved Large White, Landrace and 
Pietrain lines. 

Most of the chromosomes were detected 
to harbour QTLs that have significant or sug¬ 
gestive effects on the relative proportions of 
carcass lean and fat tissues. Information from 
the literature indicates that SSC2, 4, 8, 9, 10 
and 14 are highly associated with lean and fat 
tissue growth (e.g. Andersson et al., 1994; 
Malek et al., 2001a,b; Geldermann et al., 
2003). QTLs associated with traits such as 
loin, ham and shoulder weight or percentage, 
with or without fat, were identified on several 
chromosomes and have an important eco¬ 
nomic value. A QTL associated with entire 
ham weight was identified at the proximal end 
of SSC8 in Wild Boar x Pietrain (WB x P; 
Beeckmann et al., 2003) and Pietrain x a 
commercial cross dam F 2 crosses (Duthie et al., 
2008). A QTL associated with loin weight 
without external fat was identified at the proxi¬ 
mal end of SSC2 in Wild Boar x Large White 
(WB x LW; Andersson-Eklund et al., 1998), 
Wild boar x Meishan (WB x M; Geldermann 
et al., 2003) and Pietrain x commercial cross 
dams (P x CoL; Duthie et al., 2008). In the 
same very proximal end of SSC2, paternally 
expressed QTLs with significant effects on 
muscle growth of important carcass cuts such 
as loin, ham and shoulder were identified in 
WB x LW (Jeon et al., 1999), P x LW (Nezer 
et al., 1999) and P x CoL (Duthie et al., 2008) 
crosses. It has already been shown that an SNP 
located in a conserved CpG island in intron 2 
of IGF2 is the source of these phenotypic dif¬ 
ferences (Van Laere et al., 2003). 

As expected, large variations in QTL 
effects were also observed between experi¬ 
ments. For example, the QTL located on 


SSC2 in the IGF2 region explained 5-6%, 
3-7%, 15-30% and 30% of the phenotypic 
variance of carcass lean proportions in M x 
LW, P x CoL, WB x LW and P x LW crosses 
(Jeon et al., 1999; Nezer et al., 1999; 
Bidanel et al., 2001; Duthie et al., 2008). 
QTLs that influence loin muscle area were 
detected on all chromosomes with the excep¬ 
tion of SSC16 and 18. One of the QTLs with 
the largest effects was detected on SSC6 and 
is responsible for 21% of lean eye area vari¬ 
ance in the I x LR cross (Ovilo et al., 2000). 
In the same region, Edwards et al. (2008b) 
identified a QTL responsible for approxi¬ 
mately 5% of the phenotypic variation of loin 
muscle area in a D x P cross. 


Meat quality QTLs 

Quality of meat is influenced by several impor¬ 
tant characteristics, including general appear¬ 
ance, colour, taste, marbling, texture and 
tenderness. QTLs affecting the variation of 
these complex quantitative traits were identi¬ 
fied in several regions of most chromosomes. 
There are several overlapping QTLs associated 
with colour scores. For example, the position 
of a QTL on SSC14 associated with lightness 
values (Hunter L*) in the D x LR cross (Rohrer 
et al., 2006) coincides with a QTL for Hunter 
L* reflectance in the B x Y cross (Malek et al., 
2001b) and the colour score in an F 2 cross 
between Meishan and commercial pigs (M x 
CoL; de Koning et al., 2001). A significant 
QTL detected on SSC1 for Hunter L* by de 
Koning et al. (2001) was also reported by 
Rohrer et al. (2006). 

One of the major QTLs affecting pH u and 
colour was initially identified on SSC15 in the 
B x Y cross (Malek et al., 2001b). This QTL 
explains 4-6% of pH u variance, and the 
Berkshire allele, partly recessive, is increasing 
the trait. The QTL, which also affects muscle 
glycolytic potential, is localized in the same 
region as the RN locus. The RN~ allele 
uncovered by Milan et al. (2000) as an SNP 
(p.Arg200Gln) located in codon 200 of the 
PRKAG3 gene was absent in the population 
studied. Additional polymorphisms in PRKAG3 
(p.Thr30Asn, p.Gly52Ser, p.Ilel99Val) were 
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identified as the source of the variation 
(Ciobanu et a/., 2001). An association analysis 
of the PRKAG3 haplotypes in commercial 
populations revealed that differences in pH 
between the four haplotypes may be as high as 
0.1 pH units in all lines analysed, with the 
exception of Berkshires where the differences 
may exceed 0.2 units (Rothschild et a/., 2004). 
The results also showed that pIlel99Val poly¬ 
morphism was associated with the largest 
effects. A suggestive QTL associated with 
change in pH was identified in the same region 
of SSC15 in a D x LR cross (Rohrer et a/., 
2006). 

A suggestive QTL for tenderness score 
and Star Probe Force (a measure of tender¬ 
ness) in the B x Y cross (Malek et ai, 2001b) 
and SSF in the D x LR cross (Rohrer et at ., 
2006) was identified at about 20 cM away from 
the QTL located on SSC15 and associated 
with pH u and colour scores. One of the most 
interesting and studied QTLs associated with 
tenderness was revealed on SSC2. Suggestive 
QTLs affecting raw firmness scores, average 
Instron force, tenderness, juiciness and chewi¬ 
ness on cooked meat were mapped on SSC2 
in the B x Y cross (Malek et a/., 2001b). 
Subsequent studies mapped QTLs associated 
with tenderness in the same regions of SSC2 
in the B x D (Stearns et a/., 2005b) and D x 
LR (Rohrer et a/., 2006) crosses. Ciobanu 
et al. (2004) identified CAST as the main can¬ 
didate for the QTL and provided the first 
genetic evidence that CAST is not only a 
potential functional candidate gene but also 
harbours genetic variants that influence 
tenderness. 

Marbling represents one of the important 
appearance factors used by consumers to per¬ 
ceive quality, and significantly influences pur¬ 
chasing decision (Resurreccion, 2004). 
Significant QTLs for the subjective measure¬ 
ments of marbling were detected on SSC1, 
2, 5, 6, 7, 8, 10, 12, 13, 14 and 17. QTLs for 
amounts of intramuscular fat determined by 
objectives methods were identified in most of 
the chromosomes with the exception of SSC3, 
10, 11, 14 and 16. Chromosomal regions 
that harbour QTLs for marbling and intramus¬ 
cular fat in different populations are located 
on SSC1, 2, 5, 6, 8, 13 and 17. A QTL for 
intramuscular fat or marbling was identified 


near the heart fatty-acid binding protein 
(H-FABP) gene on SSC6, in a commercial 
population (Grindflek et a/., 2001) and in I x 
LR (Ovilo et a/., 2000, 2002) and D x P 
crosses (Edwards et ai, 2008b). This QTL 
seems to explain 15-20% of the variation in 
intramuscular fat, with Duroc and Iberian alle¬ 
les increasing the trait. Previously, Gerbens 
et ai (1999, 2000) used the candidate gene 
approach and found significant associations 
between polymorphisms in the H-FABP gene 
and intramuscular fat in Duroc pigs. Two 
potential candidate genes, H-FABP and small 
heterodimer partner (SHP), were evaluated by 
Arnyasi et al. (2006) as potential candidates 
for the QTL. Significant associations were 
observed between H-FABP polymorphisms 
and intramuscular fat, which explained 
30-35% of the variation and confirmed the 
Gerbens et al. (1999) study. However, other 
studies failed to show a clear association 
between H-FABP alleles and intramuscular fat 
(Nechtelberger et a/., 2001; Ovilo et a/., 
2002; Urban et a/., 2002). 

A QTL with significant effects on mar¬ 
bling was detected on SSC1 (near MC4R) in 
a B x Y cross (Malek et a /., 2001b). Yorkshire 
alleles have favourable effects compared with 
Berkshire alleles, but this QTL explains only 
3-4% of the phenotypic variance of intra¬ 
muscular fat. QTLs associated with intramus¬ 
cular fat were detected on the same 
chromosomal arm but closer to the proximal 
end in D x LR (Rohrer et a/., 2006) and 
Duroc x Large White (D x LW; Sanchez 
et al ., 2007) crosses. The Duroc allele in D x 
LR (Rohrer et a/., 2006) and the Large While 
allele (Sanchez et a/., 2007) increase intra¬ 
muscular fat. A significant QTL associated 
with intramuscular fat was uncovered on 
SSCX in M x LW (Harlizius et a/., 2000) and 
Duroc x Erhualian (D x E; Ma et ai, 2009) 
crosses. Alleles inherited from Meishan and 
Erhualian breeds increase intramuscular fat. 
Harlizius et al. (2000) found the estimated 
QTL effects to be smaller in females than in 
males, probably as a result of random inacti¬ 
vation of the SSCX in females. The long- 
chain acyl-CoA synthetase family member 4 
(ACSL4) gene was considered as a potential 
positional candidate for the QTL detected on 
SSCX (Mercade et al., 2006). 
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The fatty acid composition is known to 
affect nutritional, technical and sensory quali¬ 
ties of fresh and processed pork product 
(Lawrence and Fowler, 1997). QTLs influenc¬ 
ing fatty acid composition in the latissimus 
dorsi muscle and perirenal, abdominal and 
backfat have been identified in several experi¬ 
mental populations (Perez-Enciso et a/., 
2000; Clop et a/., 2003; Lee et a/., 2003; 
Kim et a/., 2006; Nii et a/., 2006; Sanchez 
et a/., 2007; Guo et a/., 2009; Uemoto 
et al., 2009). For example, Clop et a/. (2003) 
uncovered significant QTLs for fatty acid 
composition on SSC4, 6, 8, 10 and 12 in an 
I x LR intercross. The QTL located at SSC8 
had significant effects on palmitic and palmi- 
toleic fatty acids, and a suggestive effect on 
oleic fatty acid. A missense mutation in a con¬ 
served site of the MTTP gene was shown to 
be strongly associated with protein activity 
and fatty acid profile in pigs. A QTL associ¬ 
ated with fatty acid composition has been 
uncovered in an I x LR cross (Munoz et a/., 
2007) and in a Duroc commercial line 
(Quintanilla et a/., 2007). Two linked SNPs in 
acetyl-coenzyme A carboxylase a (ACAC A), a 
positional candidate gene for the discovered 
QTL, were significantly associated with carcass 
leanness, intramuscular fat, and monounsatur- 
ated, saturated (myristic, palmitic and stearic) 
and polyunsaturated (linoleic) fatty acids in the 
longissimus thoracis and longissimus lumborum 
muscles (Gallardo et a/., 2009). 

Boar taint QTLs 

Boar taint is an unpleasant odour and flavour 
of the pork from intact male pigs, primarily 
caused by high levels of androstenone and 
skatole in adipose tissue (Robic et a/., 2008). 
It has been suggested that a major gene that 
controls the level of androstenone in fat is 
segregating in Large White populations 
(Fouilloux et a/., 1997), and that another 
gene influences the skatole level (Lundstrom 
et a/., 1994). Quintanilla et al. (2003) per¬ 
formed a QTL analysis of the androstenone 
levels in fat in a M x LW F 2 cross at 100, 120, 
140 and 160 days old, and on individual boars 
slaughtered at approximately 80 kg live 
weight. There were four robust QTLs detected 


on SSC3, 7 and 14 that individually explained 
up to 15% of the phenotypic variance at 
different ages. This study indicated that the 
variation of this trait is complex and affected 
by numerous genes. A population of a similar 
genetic background was analysed by Lee 
et al. (2005) and QTLs for the laboratory 
measurements of androstenone level in fat 
measured at slaughter (85 kg live weight) were 
mapped on SSC2, 4, 6, 7 and 9. These five 
significant QTLs explained 6.2-8.6% of 
the phenotypic variance. In the same region 
of SSC6, a QTL for boar taint flavour assessed 
by a sensory panel was located. QTLs were 
also detected on SSC14 for the laboratory 
estimates of the level of indole and skatole, 
sensory panel scores for skatole and the 
scores for boar taint flavour in lean and in fat. 
In both studies, with the exception of SSC7, 
Meishan QTL alleles were associated with a 
high level of androstenone in backfat, indicat¬ 
ing an important breed effect on boar taint 
phenotype. 

Varona et al. (2005) conducted the third 
mapping study of QTLs that affect variation in 
boar taint. Androstenone and skatole levels in 
fat were determined in 217 intact Landrace 
boars, and QTL mapping was performed on 
ten regions of the genome chosen on the basis 
of previously detected QTLs for growth and 
fatness. Boar taint QTLs have been mapped 
on SSC3, 4, 6 and 7. A QTL for skatole level 
was mapped on SSC6 in a region where a 
QTL had been previously detected for andro¬ 
stenone (Lee et al., 2005). 

An association study that involved 275 
SNPs in 121 genes and more than 2800 indi¬ 
viduals from Duroc and Landrace boars 
revealed genes with significant SNPs and hap- 
lotypes associated with compounds related to 
boar taint (Moe et al., 2009). These genes 
include several cytochrome P450 members 
(CYP2E1, CYP21, CYP2D6 and CYP2C49), 

nuclear receptor NGFIB and catenin delta 
(CTNND1). Recently Grindflek et al. (2010) 
compared gene expression levels in testicle 
samples of 192 Duroc and Landrace boars 
with extreme levels of androstenone in fat. Ten 
out of the 15 candidate genes analysed were 
significantly up-regulated in high androstenone 
boars from both investigated breeds. 
Associations between SNPs from differentially 
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expressed genes and androstenone level were 
observed only for cytochrome b (5CVB5A), 
suggesting a mechanism of cis-regulation that 
modulates the expression of this gene. 


Expression QTLs 

Variation in gene expression contributes to 
phenotypic diversity and has an impact on vari¬ 
ation of high-order phenotypes, including meat 
quality traits. The advent of high-throughput 
microarray technology in combination with 
linkage analysis has allowed simultaneous map¬ 
ping of expression QTLs (eQTLs) that poten¬ 
tially control the expression of thousands of 
transcripts. Transcript abundance is an inter¬ 
mediate phenotype between DNA sequence 
variation and complex phenotypes. Gene 
expression traits are also affected by complex 
regulation and environmental noise, but they 
are considerably closer to the molecular level 
than the phenotypes they modulate. 

Lobjois et al. (2008) analysed gene 
expression in 17F 2 individuals derived from a 
commercial cross that displayed a range of 
Warner-Bratzler Shear Force (WBSF) meas¬ 
urements of tenderness. From 63 genes with 
expression associated with WBSF, 22 were 
successfully mapped in the pig genome and 12 
were located in the areas previously reported 
to harbour QTLs associated with tenderness 
(SSC2, 6 and 13). The first comprehensive 
genome-wide expression QTL mapping in a 
livestock species was recently reported by Ernst 
et al. (2010) in pigs. Expression profiles from 
loin muscle tissue of 176 F 2 D x P pigs were 
combined with linkage analysis of 124 micro¬ 
satellite markers. Physical localization of a pre¬ 
liminary genome assembly was achieved for 
13,611 oligonucleotides out of 20,400 oligo¬ 
nucleotides present in the array, and for 77 of 
the 124 microsatellite markers. There were 62 
unique eQTL detected, including 24 that 
mapped at the location of the source gene (cis 
eQTL). Comparison of the eQTL position with 
a set of 173 putative QTLs previously reported 
for growth and carcass traits uncovered 11 
common linkage regions, including seven 
regions involving putative cis-acting eQTLs. 
For example, the location of a QTL for loin 


muscle area coincided with an eQTL that mod¬ 
ulates dynein, light chain, Tctex-type 1 
(DYNLT1) expression. As shown in this study 
and others (e.g. Chen et al ., 2008; Farber 
et a/., 2009; Ciobanu et a/., 2010), differ¬ 
ences in gene expression can be used in reverse 
genetic studies to generate well-defined hypoth¬ 
eses regarding downstream effects on molecu¬ 
lar, cellular and functional networks, and finally 
at the phenotype level. 


Genetic Traceability 

Traceability is the process of identification of 
animals and animal products through the steps 
within the food chain from farm to the retailer. 
Conventional traceability consists of a labelling 
system and the management of processed food 
in batches (Dalvit et a/., 2007). This approach 
can trace the provenance of meat products and 
is mandatory in the USA and in all EU member 
countries. Genetic traceability is an approach 
for the identification of animals and their prod¬ 
ucts based on variation at the DNA level 
(Mackie et a/., 1999). However, the cost of 
DNA testing means that its practical applica¬ 
tion, at least currently, is in the verification of 
traceability systems. In the case of swine, this 
technology can be applied for identification of 
a particular animal or breed using deterministic 
and probabilistic approaches. 

Individual identification, or tracking back 
to an individual animal or to the system it has 
been produced in, helps to reinforce the safety 
of products for consumption and assist actions 
associated with the protection of the health of 
human and animal populations in the case of 
disease outbreaks. Typing highly polymorphic 
DNA markers, such as microsatellites, was 
proposed as a viable solution owing to its pre¬ 
cision and durability (Dalvit et a/., 2007). 
Match probability (MP) was used to evaluate 
the efficacy of a panel of markers. A MP esti¬ 
mate is the probability of finding, by chance, 
two individuals that share the same genotypes 
at the analysed loci (Weir, 1996). Goffaux 
et al. (2005) suggested that a panel of 21 
SNPs would give an MP of 7 x 10 -9 , which is 
sufficiently significant for the size of the pig 
population of Belgium (7 x 10 6 ). Obviously, 
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it becomes increasingly difficult to trace indi¬ 
vidual animal products in mixtures such as 
ground meat; Shackell et al. (2005) showed 
that DNA microsatellites may be useful for the 
application of DNA traceability of ground beef 
mixtures prepared from fewer than ten indi¬ 
viduals, but less accurate in mixtures from 
larger numbers. 

Individual traceability is an approach that 
supports food safety, but it can also be used to 
verify production claims. An example of this 
is the use of specific genotypes for the pro¬ 
duction of valuable products, as exemplified 
by the products of the Iberian pig (in Spain) 
and the Berkshire (in Japan and the USA). 
These products have been differentiated for 
their special sensory properties, and genetic 
tests are available to certify their origin (Dalvit 
et a/., 2007). In this case, breed traceability 
can be used to certify food products of higher 
value and quality from certain breeds. 
A genetic test that involved 25 microsatellites 
revealed that up to 20% of Iberian ham sam¬ 
ples, which cost up to ten times more than 
‘normal’ ham, have a genetic composition 
incompatible with the present Spanish 
legislation - either because the Duroc genome 
was present in a percentage greater than that 
permitted (>50%), or because of the signifi¬ 
cant presence (>25%) of white-coated pig 
genomes (Garcia et a/., 2006). 

One of the main distinguishing character¬ 
istics of swine breeds is coat colour and pat¬ 
tern (see also Chapter 3). Much of the variation 
in colour is explained by allelic variants of the 
MC1R and KIT genes (Kijas et al., 1998; 
Marklund et al., 1998; Kijas and Andersson, 
2001). Initial sequence analysis of MC1R from 
different breeds revealed four polymorphisms 
corresponding to five different E alleles, with 
one of the alleles, E p , being a composite of 
two of the polymorphisms (Kijas et al., 1998). 
Additional alleles have subsequently being 
describer (see Chapter 3). A combined dupli¬ 
cation and splice mutation of the KIT I allele 
causes the dominant white-coat colour in 
breeds such as Large White and Landrace. 
The presence of the duplication and the 
absence of the splice polymorphism of the 
KIT I p allele is associated with spotted coat col¬ 
our in the Pietrain breed. In addition, three 
KIT intronic polymorphisms have been used 


to differentiate Berkshire from Tamworth 
(Alderson and Plastow, 2004), or the Berkshire 
breed from other populations in Japan (Carrion 
et al., 2003). 

While the recent development in DNA 
sequencing, polymorphism discovery and high- 
throughput genotyping has underlined the 
potential of molecular genetics in traceability 
applications, its current use is limited because 
of the management involved in collecting and 
storing individual samples and the cost of 
genotyping. 


Breed Variation and Combinability 

Breed differences in carcass composition, 

meat and fat quality traits 

The genetic background of the breed has a sig¬ 
nificant effect on the variation of most carcass 
and meat quality traits known to be influenced 
by a large number of genes. There are com¬ 
mercial lines marketed as ‘meat quality lines’, 
but significant variation in meat quality traits 
exists within these (Gil et al., 2003; Plastow 
et al., 2005). Comprehensive reviews of the 
numerous studies that compared fat and car¬ 
cass quality traits among pure breeds/lines and 
various crosses were published in several reports 
in the 1980s and 1990s (Sellier, 1983, 1988, 
1998; Sutherland et al., 1985; Schworer 
et al., 1989; Wood and Enser, 1989; Sellier 
and Monin, 1994). Since then, genetic progress, 
adjustments in the selection criteria and the 
introduction of molecular technologies have 
generated changes in the genetic background 
and variation of certain traits in swine popula¬ 
tions. Important contributions to the differences 
among commercial lines are attributed to the 
frequencies of alleles from certain loci responsi¬ 
ble for major effects on carcass composition, 
meat and fat quality traits. RYR1 (HAL) or 
PRKAG3 ( RN) are classical examples of genes 
that harbour alleles with strong effects that drive 
important variation in many economically 
important traits. In early studies of the HAL 
locus (Gueblez et al., 1995; Hanset et al., 
1995) a significant difference in allelic fre¬ 
quency of the stress-resistant N allele in Pietrain 
(q n >0.90) and Large White (q n <0.10) was 
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determined to be the main source of the signifi¬ 
cant difference in carcass (dressing and lean 
percentage, carcass length) and meat quality 
(pH,) traits. The non-conserved R200Q substi¬ 
tution (RN~/rn + ) in the PRKAG3 gene, which 
was discovered by Milan et al. (2000), explains 
a 70% increase in muscle glycogen in RN~ 
homozygous and heterozygous animals. The 
high frequency of the RN~ mutation in US 
Hampshire pigs (p RN - = 0.63; Miller et al., 
2000) compared with Yorkshires, where the 
RN~ allele is absent, results in lower muscle 
pH u , reduced water-holding capacity and a 
much lower yield of a cured cooked ham prod¬ 
uct in Hampshire pigs (Monin and Sellier, 1985; 
LeRoy et al., 1990). 

The discovery of functional mutations and 
the introduction of genetic tests that differenti¬ 
ate all genotypes have reduced the influence of 
these two major loci on the variation of meat 
and carcass quality traits: the RN~ allele has 
been eliminated from most of the Hampshire 
populations, while the stress-susceptible n 
allele has a very limited use as a result of the 
advantages in leanness and other meat quality 
traits of the heterozygotes (Nn). In contrast, 
the new favourable PRKAG3 Ilel99 allele is 
prevalent in all common commercial breeds, 
and the high frequency of this allele in Berkshire 
(0.87) compared with other lines (Duroc, 0.38; 
Large White, 0.22; Landrace, 0.14) is one of 
the genetic sources that leads to a higher pH u 
and better colour scores of ham and loin in the 
Berkshire (Ciobanu et al., 2001). 

In recent years, there have been several 
comparison studies of the carcass composi¬ 
tion, meat and fat quality traits between com¬ 
mercial lines with different genetic backgrounds. 
In the study of Gispert et al. (2007), carcass 
composition was analysed and compared 
between Large White, Landrace, Duroc, 
Pietrain and a Meishan synthetic developed 
from a cross with a Large White-based line. 
The first four lines represent a significant pro¬ 
portion of North American and European pig 
production (Gil et al., 2008). Carcasses from 
the Pietrain line (Halothane negative) had the 
highest dressing percentage (83.3%) and were 
the shortest (81.8 cm). The Pietrain line was 
associated with the highest proportion of ham 
(270.9gAg) and the lowest proportion of belly 
(98.0 g/kg). The Pietrains were the leanest 


overall (755.9 g/kg), and the leanest in all of 
the dissected cuts compared with the Meishan, 
which was the fattest (638.5 g/kg) and had the 
highest intermuscular fat. The Duroc had a 
relatively high level of intramuscular fat and 
intermediate carcass quality but, in the study of 
Guerrero et al. (1996), it showed the best 
characteristics for dry-cured hams compared 
with Pietrain, Belgian Landrace and Large 
White x Landrace (LW x LR) crosses. 

Gil et al. (2008) characterized five com¬ 
mercial lines for meat quality parameters and 
muscle biochemical characteristics of the long- 
issimus thoracis (LT), and semimembranosus 
(SM) muscles. The muscle fibre size was the 
main difference in the LT. The largest differ¬ 
ence was found between Meishan and Pietrain 
lines, with the Pietrain muscle fibre size the 
largest. The Duroc line was differentiated from 
the rest of the breeds by muscle oxidative 
traits, and the Landrace line was differentiated 
by the high percentage of fast glycolytic fibres. 
Duroc and Pietrain were different from 
Landrace and Meishan according to the meta¬ 
bolic and contractile characteristics of the SM. 
The measured muscle characteristics were 
associated with differences in drip loss and 
marbling, and could thereby influence the eat¬ 
ing quality of pork. 

Brewer et al. (2002) analysed quality 
characteristics of pork derived from pigs of 
the following backgrounds: Duroc and Pietrain 
(Halothane negative, NN), Pietrain (Halothane 
positive, nn), Berkshire and Hampshire ( rn + ), 
Hampshire (RN~) and a synthetic line. 
A trained panel evaluated the visual appear¬ 
ance of uncooked lean and fat, and the flavour 
and texture of cooked chops. Colour scores 
(Hunter L*, a* and b* values), hue angle, cook¬ 
ing loss and Warner-Bratzler shear force were 
also measured. The genetic background 
affected the visual colour differences between 
lines, with the chops from Duroc, Berkshire 
and Pietrain (nn) being the least pink. 
Berkshire chops appeared to have the most 
marbling in the lean, and those from Pietrain 
(nn) appeared to have the least marbling. 
Chops derived from Pietrain (nn) and 
Hampshire (rn + rn + ) had the highest a* values, 
while those from Duroc, Pietrain (NN) and 
Hampshire (RN~) had the lowest. Chops from 
Duroc carcasses were associated with the 
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lowest cooking loss (18.8%) compared with 
Hampshire (rn + rn + , 21.6%) and Pietrain (NN, 
20.0%). The highest shear force was observed 
for chops from Pietrain (nn, 6.7kg) and 
Hampshire (rn + rn + , 6.0kg) carcasses. Pork 
derived from Hampshire (RN~) was associated 
with the highest juiciness, followed by those 
from Hampshire (rn + rn + ), Pietrain (NN), 
Berkshire and Duroc. 

In the study by Ciobanu et al. (2001) of 
the influence of PRKAG3 alleles on meat 
quality traits in five commercial lines, Berkshire 
was associated with the highest loin pH u 
(5.83) and ham pH u (5.81) and the lowest 
ham Minolta b* (3.46) and Minolta L* (43.12) 
(colour/reflectance scores). Duroc was associ¬ 
ated with the lowest drip percentage (0.14%), 
while Large White had the highest (2.32%). 
Plastow et al. (2005) studied variability for 
meat and fat quality traits, within and between 
five commercial breeds, and found Duroc to 
be associated with the lowest drip loss (2.49%), 
Minolta b* (3.44) and L* score (46.15) and, 
together with Pietrain (NN), the highest pH u 
(5.64). The highest proportion of intramuscu¬ 
lar fat in LT was found in Meishan (1.8%) and 
Duroc (1.9%) lines, followed by Pietrain 
(1.2%, the overall leanest line), Landrace 
(1.1%) and Large White (1.0%). Regarding 
the fatty acids, the percentage of stearic acid 
was higher in the Duroc line (13.4%) than in 
the rest of the lines, with Pietrain having the 
lowest (11.9%). The percentage of stearic 
acid was relatively low in Meishan (12.17%), 
even if the Meishan LT was associated with 
highest marbling. Interestingly, the abundance 
of linoleic acid was significantly lower in Duroc 
and Meishan than in Landrace, Large White 
and Pietrain lines. The report suggests that 
the metabolism of these C18 acids was differ¬ 
ent between lines, which could be significantly 
influenced by genetics. 


Heterosis and maternal effects 
in breed crosses 

Crossbreeding is used almost exclusively in 
the pig industry, and a better understanding of 
the best combinations of genetic backgrounds 


to achieve heterosis for a particular trait is 
critical in implementing optimal crossbreed¬ 
ing schemes (Sellier, 1998). It is well estab¬ 
lished that heterosis has a limited effect on 
carcass traits, as exemplified by the F x crosses 
between breeds that display important differ¬ 
ences for this group of traits, such as Large 
White (or Yorkshire) and Meishan (Poilvet 
et al ., 1990; Serra et al., 1992; Lan et al., 
1993). The direct and maternal heterosis 
effects on meat quality traits are also relatively 
small, and are described in several reports 
that employed most breed combinations and/ 
or most meat and eating quality traits (e.g. 
Young et al., 1976; Schneider et al., 1982; 
McLaren et al., 1987; Lo et al., 1992; Serra 
et al., 1992; Bidanel et al., 1993; Lan et al., 
1993; Ellis et al., 1995). Previous studies 
showed heterosis effects on pH l5 pH u and 
related traits in crosses informative at the RN 
and HAL loci, summarized by Sellier (1998). 
As discussed above, the RN~ allele has been 
eliminated from most of the Hampshire popu¬ 
lations, while the stress-susceptible n allele is 
maintained at low frequency in certain popu¬ 
lations owing to advantages in leanness and 
other meat quality traits in heterozygotes 
(Nn). For example, the crosses that involve 
the ‘stress-sensitive’ Pietrain showed favour¬ 
able heterosis effects on pH 1} with the F : 
hybrids having a phenotype closer to the 
‘stress-resistant’ parent (Sellier, 1987). 
Significant direct heterosis effects (-20% to 
-13% of the parental mean) were reported 
on muscle lipid content in the Duroc x 
Landrace cross (Wood et al., 1987; Lo et al., 
1992). Heterosis was also observed for intra¬ 
muscular fat in females of the crosses that 
involved Large White and Meishan (Serra 
et al., 1992) or for Large White and Meishan 
x Large White (Poilvet et al., 1990). 

The genetic correlation between the per¬ 
formance in a purebred Genetic Nucleus envi¬ 
ronment and in a crossbred-commercial 
environment is less than one, owing to different 
degrees of sensitivity. For example, the esti¬ 
mates for backfat range from 0.21 to 0.88, 
while those for growth vary from 0.19 to 0.99 
(Brandt and Taubert, 1998; Lutaaya et al., 
2001). As a result, crossbred performance has 
begun to be used in combination with purebred 
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performance in an index to estimate the sensi¬ 
tivity of the genetics inherited from a boar 
within a range of environments. Combined 
crossbred and purebred selection approaches 
have been reported to be superior in certain 
situations to pure selection alone (Bijma and 
van Arendonk, 1998; Lutaaya et a/., 2001; 
Perez et a/., 2006). For example, the incorpo¬ 
ration of and emphasis on commercial/cross- 
bred records can increase the accuracy for 
breeding for pH by 0.04 units (A.A. Sosnicki, 
personal communication). 


Future Directions in Breeding on 
Carcass and Meat Traits 

Significant emphasis on production efficiency 
remains a priority in pork production. This 
emphasis is critical in the face of demand for 
the responsible use of natural resources in food 
production. Advances in quantity of produc¬ 
tion and efficiency of production should not 
come at the expense of processing quality, 
sensory quality or nutritive quality. Several 
major changes witnessed lately in the food 
industry, consumer choices and the recent rev¬ 
olution of molecular technologies could have 
an important impact on the future role of 
genetics in meat quality improvement. Industry 
consolidation in the last few years has led to 
the integration and coordination of major sec¬ 
tors of food production systems from animal 
breeding to packing and retail. Consumer 
demand for high-quality products has led to an 
important shift from ‘commodity pork’ to meat 
product differentiation and ‘value-added pork’, 
which is superior for eating quality characteris¬ 
tics such as tenderness, juiciness and flavour. 
Future emphasis on the quality of other pork 
cuts will provide important information regard¬ 
ing the use of valuable products such as ham 
and bacon. 

A number of single DNA tests have 
become available to breeders for use in the 
past few years. While currently there is no 
premium associated with higher quality car¬ 
casses, some of these markers are used 
intensely in the genetic improvement of lines 
dedicated for the sector of the industry that 


manages both pig production and meat pack¬ 
ing activities. The main traits targeted by 
genetic marker improvement are drip loss and 
meat pH. More excitingly, rapid progress has 
been achieved in the development of more 
efficient molecular technologies for high- 
throughput genotyping, DNA sequencing and 
transcriptome profiling. At the end of 2008, 
an International Consortium and Illumina, 
Inc. launched the PorcineSNP60 BeadChip, 
which contains 62,163 SNP assays that uni¬ 
formly cover the entire swine genome (Ramos 
et a/., 2009b). This SNP chip provides 
improved power to map and dissect QTLs 
responsible for phenotypic variation. The 
high-density genotypes generated by this 
array can also be utilized to predict genomic 
breeding value that can then be used in 
genomic selection, a concept introduced by 
Meuwissen et a/. (2001). This represents a 
unique opportunity to integrate molecular 
data with traditional quantitative measure¬ 
ments and modern computational approaches 
to provide accurate genetic predictions for 
selection in livestock. Accurate prediction val¬ 
ues have the potential to dramatically change 
selection in the swine industry and the quality 
of the pork products. Genetic prediction can 
also take advantage of the high-throughput 
sequence data provided by next-generation 
sequencing methods (Meuwissen and Goddard, 
2010). Recent advances in massively parallel 
DNA sequencing have accelerated the pace 
and reduced the cost of genome re-sequencing, 
and provide steady-state quantitative and 
qualitative measurements of transcriptome 
diversity. All of these advancements will have 
an important impact on our ability to under¬ 
stand genome biology and to better character¬ 
ize the components of molecular networks 
that are affected by genetic variations, and to 
induce changes in metabolic traits, including 
those that affect meat quality. 
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Introduction 

Genetic improvement in pigs began several 
centuries ago leading to a clear transformation 
from the wild pig to the domestic pig. The 
process of domestication began about 10,000 
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years ago, according to the archaeological 
data (Chapter 2). This was followed by breed¬ 
ing for specific characters and the develop¬ 
ment of specialized breeds and lines. The 
genetic improvement at that time was based 
on empirical methods or due to ‘unconscious 
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selection’, as termed by Darwin (1859). The 
methods became more scientific after the dis¬ 
covery of Mendel’s principles and the develop¬ 
ment of genetics as a scientific discipline. A 
historical perspective of the early genetic 
improvement methods in pigs was provided by 
Ollivier (1976). 

During the early part of the last century, 
the focus was on breed characteristics and 
physical appearance. This included the main¬ 
tenance of pedigree records, herd books and 
prizes for champion boars and sows. Later, 
starting in the 1950s, genetic improvement 
efforts were diverted towards reduction in 
backfat and improvement in growth rate or 
days to market. However, breed characteris¬ 
tics and traits of physical soundness were still 
of utmost importance. This resulted in remark¬ 
able genetic progress in lean meat production 
and growth rate, while very small or negligible 
gains were made in reproduction traits. 
A more detailed overview of the genetic 
changes in different breeds and traits during 
the past century was given by Merks (2000). 
The majority of these changes were as a result 
of improvements in performance recording 
and genetic evaluation methods. 

Modern pig breeding is evolving as a 
technology-based industry, making use of 
advancements in computing, Internet commu¬ 
nications, biotechnology and molecular biol¬ 
ogy. In this chapter on modern genetic 
improvement, updated from Ollivier (1998), 
the principles of genetic improvement and the 
organization of current genetic improvement 
programmes will first be reviewed. This will be 
followed by discussion of the definition of 
breeding objectives, choice of selection cri¬ 
teria, including traits with economic and non¬ 
economic values, and their use in breeding 
value estimation. Detailed information is then 
provided for the design of breeding pro¬ 
grammes, including the selection of special¬ 
ized sire and dam lines, transfer of genetic 
gains in commercial populations, and adapta¬ 
tion of breeding programmes to meet the 
emerging needs of society. In the final section, 
the assistance that genetic markers and 
genomic selection can provide for further 
enhancing genetic progress will be discussed. 


General Principles of Genetic 

Improvement 

The first objective of a breeding programme is 
to produce the ‘most improvement per unit of 
time’, as stated by Dickerson and Hazel (1944). 
They showed that selection response depends 
on three parameters, which may differ between 
the sexes, i.e. selection accuracy (p, defined as 
the correlation between the aggregate geno¬ 
type and the selection criterion), selection 
intensity (i) or standardized selection differen¬ 
tial, and generation interval (t). The expected 
annual response (R a ), expressed in genetic 
standard deviation units, is: 

K = 0's P, + U Pd)/( f s + 0 (16.1) 

where the subscripts refer to sires (s) and dams 
(d), respectively. This formula can be rewrit¬ 
ten as: 

K = 0's r i Hs + <d r,™) * ° G / (*s + 0 (16-2) 

where r 1H refers to the accuracy of the genetic 
evaluation or the correlation between the 
breeding goal and the observations recorded 
on the animals, and a G is the available genetic 
variation. Genetic gain increases if selection 
intensity increases, if accuracy of estimating 
the genetic value increases or if generation 
interval decreases. Genetic variation is con¬ 
sidered to be a given constant, although it 
might be differentially expressed (e.g. in heat- 
or disease-challenged environments). 

Accuracy of the genetic evaluation 
depends on the availability of observations and 
on the heritability of the trait under scrutiny. 
The main basis of observations is records on 
an animal’s own performance. In addition, 
other sources of information are performance 
records on parents, offspring and other rela¬ 
tives. In practice, a combination of these 
sources exists. The sire can have observations 
on a (large) number of offspring, i.e. full and 
half sibs of the animal, the dam has her own 
(reproduction) records, and the selection candi¬ 
date has its own recording of finishing traits. In 
a basic situation when only own performance 
records are available, the accuracy equals the 
square root of the heritability. In other words, 
if the heritability of a trait is high, then the 
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availability of own performance observations 
will yield a high accuracy and higher response 
to selection. 

The breeding objective should define the 
main goals of genetic improvement. It must 
define a production environment and traits rel¬ 
evant for genetic improvement. The goal is 
then to maximize genetic change over genera¬ 
tions or over time. An important aspect to con¬ 
sider is the dissemination of the genetic changes 
from the nucleus breeding farms down to the 
multiplier and commercial farms (the breeding 
pyramid). This process creates genetic lags 
(Bichard, 1971), which can be minimized by 
acting on the structure of the breeding pyramid 
(using artificial insemination (AI), for instance), 
as well as on the genetic level of the individuals 
migrating from one tier to the next, using the 
gene flow techniques proposed by Elsen and 
Mocquot (1974) and Hill (1974). An economic 
optimization of the whole system may also be 
attempted by introducing cost-benefit consid¬ 
erations at all levels of the breeding pyramid, 
as Elsen and Sellier (1978) did for determining 
an optimal selection policy in dam lines. The 
breeding goal typically includes more than one 
trait in the breeding objective. Multi-trait index 
selection relates traits from the breeding 
objective through economic values to traits 
measured for the genetic evaluation. 

The rate of genetic improvement also 
depends upon the amount of available genetic 
variation (a G ). Maintenance of genetic variation 
and biodiversity is also an important element of 
sustainable animal breeding and reproduction. 
The loss of genetic diversity within a breed is 
related to the rate of inbreeding (dF). Factors 
influencing the maintenance of genetic varia¬ 
tion in quantitative traits in populations under¬ 
going selection were reviewed by Hill (2000). 

Modelling genetic gain and rate of 
inbreeding can be very helpful. Stochastic 
models can take into account the variability 
expected in genetic responses, as well as the 
changes in genetic variances and covariances 
under selection in populations of limited size. 
Simulation models such as those of Belonsky 
and Kennedy (1988) and De Roo (1988) are 
now increasingly relied upon in the study of pig 
breeding schemes. These models simulate 
selection in populations on an individual basis. 
Wray and Thompson (1990) developed the 


theory of genetic contributions, Bijma and 
Woolliams (2000) extended this theory by sup¬ 
plying a set of deterministic formulae predict¬ 
ing rate of inbreeding and genetic progress in 
situations with best linear unbiased prediction 
(BLUP) selection and overlapping generations. 
This set of formulae was then included in the 
software package SelAction (Rutten et al., 
2002), which helps breeders develop total 
merit selection indexes and predict response to 
selection. 

The two disadvantages of inbreeding are 
loss of genetic variation and the increase in the 
risk of homozygosity of deleterious alleles. 
Crossbreeding of genetically different lines 
reduces the chance of homozygosity and results 
in heterosis. Crossbreeding is key in the set-up 
of modern pig production pyramids. It opens 
the opportunity for specialized sire and dam 
lines, and it opens the opportunity for specific 
correction of the shortcomings of a line or 
cross. A given dam cross can be mated with a 
fast-growing sire line to produce heavily mus¬ 
cled lean animals for a low carcass weight mar¬ 
ket or to a slow growing synthetic fatter sire 
line for the production of Parma ham (a type of 
dry-cured ham originating from the Parma 
region of Italy). 


Organization of Genetic Improvement 

Pork value chain 

In most parts of the world, pigs are raised to 
produce pork. The production of pork happens 
through a chain of events starting from the pro¬ 
duction of breeding stock and ending with the 
production of pork products sold in retail out¬ 
lets. The pork value chain includes all players 
involved in pork production, from genetics sup¬ 
pliers and pig producers to the slaughterhouses 
and the further-processing and retail outlets 
that bring the final products to the pork con¬ 
sumers (Fig. 16.1). In addition, transport com¬ 
panies and feed suppliers also contribute to the 
pork value chain. Each link in this chain has its 
specific and important function and the chain is 
as strong as its weakest link. There is also a 
strong interdependence between the value- 
chain partners. Moving forward from genetics 
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Effect of genetic improvement 
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Fig. 16.1. Pork value chain. 


Genetic improvement needs 


to production, the efficiency of production 
depends partly upon the quality of the breeding 
stock. The quality of pork at the processing 
plant depends upon the quality of the pigs from 
the production level and the breeding stock. At 
the same time, there is dependence from the 
consumer side, e.g. if there is a food safety 
issue with pork purchased by the consumer, it 
needs to be traced back to the processing plant 
and the farm of origin until the source of the 
problem is identified. 

In many countries, the partners in the 
pork value chain work together for the com¬ 
petitiveness of their pork industry in the world 
market, while in some other countries, individ¬ 
ual companies take up that role. In Denmark, 
pork production is highly integrated, and 
breeding, production and processing are all 
operated by one organization. In Canada, a 
national pork value chain round table has been 
created as a forum for interaction among pro¬ 
ducers, processors and the government, while 
individual breeders and breeding companies 
operate on their own and interact with their 
value chain partners. In the USA and some 
countries in Europe, there is no such national 
forum of value chain partners, but the multina¬ 
tional companies have an integrated system of 
their own, starting from the suppliers of genet¬ 
ics, to producers and processors. 

Integrated pork value chains allow breed¬ 
ing stock suppliers to understand the future 
needs of their customers and target genetic 
improvement programmes accordingly to pro¬ 
vide the right type of breeding stock at the right 
time for the competitiveness of their respective 
value chains. The value chains, whether national 
or multinational, typically develop their own 
competitive strategies. These include the target 


markets, specific products that will bring them 
more value in the future, and methods to reduce 
costs by the integration of processes and avoid¬ 
ing duplication. In addition to attention to the 
quality of the product, it is important for the 
value chains to develop their product to con¬ 
form with the needs of society, such as food 
safety, animal welfare, animal health and reduc¬ 
tion of impact on the environment. 

Genetics is the first link in the value chain. 
The quality and price of the final pork product 
depend upon the quality of the breeding stock 
and its ability to produce pork with minimum 
input costs for producers and processors. Fork 
value chains that have a strong breeding com¬ 
ponent should therefore be the most successful 
in the world. 


Pig breeding pyramid 

In most countries, pig breeding programmes 
operate in a three-tier pyramidal structure 
(Fig. 16.2). At the peak of the pyramid are 
the nucleus breeding farms that actually gen¬ 
erate the genetic changes, followed by the 
next tier of multiplier farms that carry out 
specific matings or crossbreeding for the pro¬ 
duction of large numbers of females. These 
females are then sold to commercial produ¬ 
cers for piglet production and finishing to 
produce market pigs that are sent to slaugh¬ 
terhouses to produce the pork. In addition, 
there are a number of variations in the tiers of 
the pyramid, such as separate operations for 
piglet production and finishing, piglet produc¬ 
tion with own multiplication, multiplication 
with a nucleus module, etc. The nucleus farms 
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Fig. 16.2. Pig breeding pyramid. 

actually conduct breeding and selection for 
the genetic improvement of specific breeds or 
lines. They target their selection programmes 
to the needs of their customers, the pork pro¬ 
ducers and processors. However, there is a 
time delay between the genetic improvement 
in the nucleus farms and the transfer of genetic 
gains to commercial producers through multi¬ 
pliers. This delay is typically 3-5 years and is 
called genetic lag. It is, therefore, very impor¬ 
tant for the nucleus breeders to evaluate the 
future needs of producers ahead of time and 
decide their breeding goals accordingly. 

The nucleus breeding farms typically 
maintain specific breeds or breeding lines. The 
commonly used breeds are Duroc, Pietrain, 
Landrace and Yorkshire. In addition, breeding 
companies develop their own synthetic lines 
by crossing two or more breeds and further line 
breeding within the crossbreds for several gen¬ 
erations to combine and stabilize the desirable 
characteristics. Furthermore, within a breed, 
specific lines are developed to advance certain 
traits of interest, while keeping up the main 
breed characteristics. Among the purebreds, 
Landrace and Yorkshire (Large White) are typi¬ 
cally selected for sow productivity, while Duroc 
and Pietrain breeds are selected for efficiency 
during the finishing phase, quality of the car¬ 
cass and the resulting pork. Crosses between 


Landrace and Yorkshire (Large White) are used 
by multipliers to produce F : females that are 
then used by piglet producers or by farrow to 
finish operations to produce finishing pigs. 
These breeds and lines derived for production 
of the required females are therefore called 
dam lines. The breeds and lines used for mat¬ 
ing with the Fj_ females to produce the finishing 
pigs are therefore called sire lines. In practice, 
there are many variations of using specific 
breeds as sire or dam lines. In some specific 
cases, Yorkshire/Large White is used as a sire 
line while Pietrain and Duroc are used as dam 
lines. In all these cases, they are selected either 
as sire line or as dam line for several genera¬ 
tions. Genetic improvement within sire or dam 
lines is then targeted towards improvement in 
specific characteristics that are important to the 
producers and consumers. 

Role of breed societies 

The maintenance of herd book and breed char¬ 
acteristics by breed societies has been a long¬ 
time trend in pig breeding and still continues to 
be so. Breed societies are either for a specific 
breed or a number of breeds. These societies 
(e.g. the Canadian Swine Breeders Association) 
define the breed characteristics and maintain 
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pedigree records and the rules of entry of 
animals into the herd book. Breed societies 
often issue certificates of breed purity that are 
especially used for the export of animals. 
Traditionally, breed purity has been mainly 
determined through pedigree records. However, 
some breed societies (e.g. the National Swine 
Registry in the USA) have introduced manda¬ 
tory DNA sample collection that is used for 
breed purity tests in disputable cases. The roles 
of breed societies have evolved over time. 
Increasingly, they have moved to the mainten¬ 
ance of DNA banks, supporting research, edu¬ 
cation of members and promotional activities. 
Many breed societies provide services for per¬ 
formance recording and genetic evaluations, 
either directly or through collaboration with 
other institutions, and play an important role in 
genetic improvement through national improve¬ 
ment programmes in their respective countries. 

Role of breeding companies and national 
improvement programmes 

Increasingly, pig breeding has moved from 
purebred breeding and genetic improvement 
through breed societies to the breeding of spe¬ 
cific lines and associated services by breeding 
companies. In addition to supplying breeding 
stock, breeding companies or professional 
breed societies provide a range of services - 
from breeding and multiplication to production. 
Breeding companies typically can be faster in 
the extensive adoption of new technology, such 
as use of ultrasound and electronic equipment, 
database technology, molecular genetics and 
complicated statistical methods for genetic eval¬ 
uation and selection. Breeding companies have 
transitioned pig breeding to a technology-based 
industry and play an important role in support¬ 
ing research at universities and public institu¬ 
tions (Knap et a/., 2001). 

Many breeding organizations, whether 
private companies or professionalized breed 
societies, operate in more than one country and 
even have nucleus breeding herds in several 
countries. An overview of the relative market 
shares of breeding companies and national pro¬ 
grammes covering several European countries, 
Canada and Australia was given by Brascamp 


(1994). The review suggested that national 
breeding programmes still had an important 
role in many countries at that time, in contrast 
with the poultry situation, for instance, where 
hardly any national involvement existed. During 
the past decade, there has been further consoli¬ 
dation in the pig breeding industry, as in other 
major industries. The number of breeders has 
declined, while the total number of breeding 
pigs has remained relatively steady, and larger 
breeding companies have picked up higher 
market shares. A list of the main breeding com¬ 
panies and breed societies and their relative 
market shares was summarized by the Working 
Group on the ‘FABRE Technology Platform’ in 
their ‘Strategic Research Agenda’ published in 
2008, and is given in Table 16.1. The table 
shows the important role that is played by breed¬ 
ing companies in Europe, followed by those in 
North America, with a diminishing role for 
breed societies and herd book organizations. 

In addition to breeding companies, national 
improvement programmes continue to exist in 
several countries, following the early Danish 
example, where all three tiers of the pyramid 
are integrated into a national programme. In 
such structures, with a dispersed nucleus rather 
than a single company nucleus, animals from 
several breeding farms are evaluated at central 
testing facilities, and the selected animals may 
then be used in the national herd by AI. Various 
routes have been taken in countries that have 
tried to maintain national programmes. The 
extremes are on one side a closed selection sys¬ 
tem, where a breeding company system is, so to 
speak, extended to the whole country, as in 
Denmark for instance, and on the other end an 
open selection system, for example in Canada, 
where genetic evaluation is carried out across 
several breeding organizations while they act 
rather independently in using those evaluations 
for genetic improvement. In that respect, France 
may be an extreme case, where most breeding 
companies are incorporated into the national 
programme. 

Role of reproduction technologies 

The progress in genetic improvement over the 
past years has been strongly supported by 
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Table 16.1. Estimated market shares of different breeding organizations in different countries. Source: 
FABRETP (2008). 


Organization 

Developed countries (%) 

Worldwide (%) 

EU-based organizations 

PIC (= Genus), UK 

18 

10 

TOPIGS, Netherlands 

8 

5 

Danbred, Denmark 

6 

3 

Hypor-Genex, Netherlands 3 

6 

3 

JSR, UK 

3 

1 

Seghers Rattlerow, Belgium-UK (incl. Newsham, USA) 

3 

1 

Herdbooks/Nucleus, France 

3 

1 

ACMC, UK 

2 

1 

Herdbook, Poland 

2 

1 

Herdbooks, Italy 

2 

0.5 

Herdbooks, Germany 

2 

0.5 

BHZP, Germany 

1.5 

0.5 

France Hybrides, France 

1.5 

0.5 

Herdbooks, Eastern EU 

2 

0.5 

Total of EU based organizations 

60 

28.5 

Non-EU-based organizations 

Monsanto, USA b 

5 

2 

Smithfield Genetics, USA 

3 

1 

Genetiporc, Canada 

3 

1 

National Swine Registry, USA 

5 

2 

Canadian National Breeders, Canada 

5 

2 

Total of non-EU based organizations 

21 

8 


a Hendrix Genetics acquired the pig breeding part of Nutreco (Euribrid: Hypor-Genex) in June 2007. 
b Newsham (USA) acquired the pig breeding part of Monsanto in September 2007. 


developments in assisted reproduction tech¬ 
nologies (ART), as described in Chapter 11. 
One of the most used among these is AI, which 
allows easier dissemination of genetics from 
nucleus to commercial farms, with considerably 
lower risk of disease transmission and a smaller 
genetic lag. The use of AI is especially cost- 
effective for transmission of genetics between 
countries, while transport of live animals can be 
problematic. In addition to helping dissemina¬ 
tion of superior genetics over a wider population 
base, AI also helps health control and herd man¬ 
agement, and it improves connectedness for 
more accurate across-herd genetic selection 
(Haley, 1991; Mathur et a/., 2002; Fouilloux 
et a/., 2008). Before 1991, the use of AI 
remained at a low level (Sellier and Rothschild, 
1991); however, within the following decade the 
use of fresh-semen AI fully matured and contrib¬ 
uted significantly to across-herd genetic evalua¬ 
tion and selection in national and multinational 
breeding programmes (Knap et al., 2001). 


Furthermore, new developments in sperm 
sexing technology allow for the production of 
either male or female offspring for sire or dam 
lines, and this can be very useful for the nucleus 
and multiplication tiers. The technique involves 
the separation of spermatozoa carrying either X 
or Y chromosomes (Maxwell et al ., 2004; 
Garner, 2006). The sorted sperm can then be 
used for AI or embryo transfer. Practical applica¬ 
tion of this technology in pigs is still very limited, 
and depends upon the efficiency of the separa¬ 
tion technique and the resulting conception rate. 
Current speed of separation requires the use of 
deep intrauterine insemination techniques, such 
that much lower doses can be applied. 

Another such reproductive technology 
(Chapter 11), embryo transfer (ET), has been 
used in pigs for the last 60 years (Kvasnitsky, 
1950). Since its initial application, further devel¬ 
opment of ET has been hampered by the avail¬ 
ability of methods to effectively cryopreserve 
swine embryos, unlike the situation in cattle 
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(Youngs, 2001). There are two main methods 
of ET, surgical and non-surgical. Surgically 
recovered embryos are typically too old for suc¬ 
cessful non-surgical implantation. The problem 
of non-surgical ET is the difficulty in recovering 
embryos from donor sows and crossing the cer¬ 
vical barrier after the fertile phase has passed. 
Recovery of embryos requires flushing, which is 
technically too demanding in sows, or requires 
slaughter of the donor sows. The procedure also 
actually requires sows that produce a sufficient 
number of viable embryos for ET. More recent 
developments in the technology, especially in 
non-surgical and endoscopic procedures, may 
overcome some of the problems for the future 
wider use of ET in commercial practice 
(Hazeleger and Kemp, 2001; Martinez et a/., 
2004); currently, the technique is mainly used 
for research. If fully successful, there are distinct 
advantages of the technology. Especially, 
embryo transfer allows for transfer of 100% of 
the genome across tiers in the pyramid, while AI 
transfers only 50% of it. In addition, ET has 
reduced transportation cost and there is a lower 
risk of the introduction of diseases, especially 
compared with the use of live animals. In view 
of these considerations, there is high demand 
for use of this technology by multinational 
breeding companies and for research on 
cloning. 

The use of cloning technologies was ini¬ 
tially problematic in pigs. However, a number 
of technical difficulties have been overcome to 
some extent, leading to higher success rates 
(Onishi et a/., 2000; Polejaeva et a/., 2000; 
Betthauser et a/., 2001). Offspring produced 
by the new methods grow normally and are 
fertile (Onishi, 2002; Shibata et a/., 2006). 
Furthermore, the use of parthenogenotes in 
place of embryos to establish pregnancy and 
promote the development of a single co¬ 
transferred embryo offers new opportunities 
(Kawarasaki et a/., 2009). It has been postu¬ 
lated that cloning will allow faster transfer of 
genetically superior animals, even directly to 
commercial producers. In this way, the tech¬ 
nology can offer the possibility of reducing the 
genetic lag between nucleus, multiplication 
and commercial tiers of the breeding pyramid, 
and even eliminate the need for one or two 
tiers (Visscher et a/., 2000; Niemann et ai, 
2003). 


A combination of reproduction technol¬ 
ogies with advancements in molecular genetics 
is particularly expected to deliver a powerful 
impact in increasing the rate of genetic progress 
and to allow for efficient use of genetic varia¬ 
tion and reduce genetic lag. 


Definition of an Overall Breeding 

Objective 

Choice of breeding goals 

An essential step in any genetic improvement 
programme is the definition of the overall 
breeding objective or breeding goal. This 
includes traits of interest, direction of improve¬ 
ment and relative significance of each trait. 
Initially, the main breeding objective of breed 
societies was to maintain breed characteristics, 
and selection was mainly based on exterior 
traits. Over the years, breeding organizations 
have been focusing their breeding goals on 
lowering the cost price of pork production and 
the quality of pork in terms of processing and 
consumer appreciation. It is important to note 
that the breeding objective is actually imple¬ 
mented at the nucleus level, but it is targeted to 
the production level and to future markets. 

In commercial breeding, breeding goals 
are defined separately for sire and dam lines. 
The sire lines are mainly selected for genetic 
improvement in production traits such as 
growth rate, carcass quality, meat quality, feed 
conversion and conformation. The breeding 
goal for dam lines typically includes the same 
production traits, but in addition female repro¬ 
ductive traits, such as litter size, piglet survival, 
age at puberty, conception rate, number 
weaned and weaning to oestrus interval. In 
more detail, traits like farrowing survival, lacta¬ 
tion survival, number of teats, litter weight, ges¬ 
tation length, longevity, age at first service, 
number stillborn and litter mortality may also be 
considered in the breeding goal for dam lines. It 
is important to note that genetic progress in 
individual traits is inversely related to the number 
of traits included in the breeding goal, and that 
negative or positive genetic correlations 
between traits exist. Therefore, it is important 
to choose traits with great care. 
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The choice of breeding goals and success¬ 
ful breeding have led to the genetic improve¬ 
ment of economically important traits such as 
daily gain, backfat thickness, feed efficiency 
and litter size, especially during the last decade. 
However, this is not enough for the future. 
Breeding goals have been, or are presently 
being, set up more broadly to include traits that 
are important to society. There is increasing 
societal pressure and desire from several pig 
producers to include traits such as the safety 
and improved quality of pork, the health and 
welfare of pigs, and the environmental impact 
of pig production (Verbeke and Viaene, 2000; 
Kanis et al ., 2005). Further, the health of the 
pigs is becoming more important owing to the 
concentration and increasing scale of pig pro¬ 
duction. This requires strict biosecurity meas¬ 
ures and high-health breeding farms, as well as 
selection for general disease resistance under 
commercial conditions (Merks, 2000; Bishop 
and MacKenzie, 2003). 

Derivation of economic weights 

Once the breeding goal traits have been cho¬ 
sen, the question arises of giving each trait its 
proper weight in an overall objective, also 
termed aggregate genotype. A common prac¬ 
tice is to use profit equations of the form: 

P (profit) = R (returns) - C (costs) (16.3) 

By taking partial derivatives of P with respect 
to the n traits included as goal traits in R and C, 
economic weights are obtained and an aggre¬ 
gate genotype (H) is established of the form: 

H = w^A^ + W 2 A 2 + ... + w n A n (16.4) 

where A 1 -A n are the breeding values for the 
traits included in the breeding goal and w 1 -w n 
are their respective economic weights. This has 
been the approach in the classic paper of Hazel 
(1943) on selection index theory. However, as 
pointed out by Moav (1973), the economic 
weight so derived depends on the perspective 
taken in defining P. This dilemma of different 
economic weights for different perspectives in 
production, under unchanged economic condi¬ 
tions, was resolved by Smith et al. (1986), who 
suggested imposing two conditions. The first 
is that fixed costs, incurred in running the 


production enterprise, should be included with 
other costs and expressed per unit of output. 
The second condition is that any extra profit 
from genetic change that could also be obtained 
by altering the size of the enterprise should not 
be counted. The application of these two condi¬ 
tions also shows that the economic weights are 
those obtained in considering cost per unit 
return, i.e. C/R instead of R - C. More complex 
economic models as possible alternatives were 
given by Amer et al. (1994). 

Economic weights can also be derived from 
a system of equations known as a bio-economic 
model, or profit function, which is a function of 
phenotypic traits and management variables 
contributing to revenue and/or costs (Tess et a/., 
1983), as well as from the model of De Vries 
(1989). The economic weight of a trait is calcu¬ 
lated from the change in predicted profit based 
on a single unit change in that trait, holding all 
other traits constant, or directly from the partial 
derivatives evaluated at the relevant population 
mean (Moav and Hill, 1966). A similar profit 
function approach was used by Quinton et al. 
(2006) to derive economic weights for sow pro¬ 
ductivity traits. The advantage of using the profit 
function approach is that economic weights can 
be recalculated for different production systems, 
market requirements or population trait levels 
with relative ease. For example, as litter size 
increases, the relative economic value of 
increases in litter size decreases but that of piglet 
survival increases (Knol and Mathur, 2009). In 
this way, the economic values can be different 
for different population means at the commer¬ 
cial levels, and the profit function approach 
allows for adjusting them accordingly. 

In some cases, genetic improvement is 
desired in traits that do not provide a direct eco¬ 
nomic return but are important for non-economic 
reasons. Such traits have non-economic values. 
For example, improvement in physical sound¬ 
ness leads to economic benefits such as reduc¬ 
tion in veterinary costs and better sow longevity, 
but there are additional non-economic benefits, 
such as those for pig welfare and ethics. In this 
case, the trait has an economic as well as a non¬ 
economic value. The relative importance of eco¬ 
nomic and non-economic values for different 
traits was described by Kanis et al. (2005), as 
given in Table 16.2. Non-economic values are 
also derived in monetary units and included in 
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Table 16.2. Example of the allocation of the relative economic and non-economic values of different traits. 
Source: Kanis et al. (2005). 


Trait/characteristic 

Economic 

Value 

Non-economic 

Explanation of non-economic value 

Growth rate 

++++ 

+ 

Traits reflect pig welfare and health, and are 

Feed efficiency 

++++ 

+ 

related to a possible manure surplus 

Meat percentage 

+++++ 


Mainly economically important 

Meat quality 

++ 

+++ 

Better quality is desired but not paid for 

Litter size 

+++++ 


Mainly economically important 

Sow longevity 

+ 

++++ 

Replacement costs of sows are relatively low, 
but citizens attach value to longevity 

Piglet vitality 

+++ 

++ 

Apart from the economic reasons, impaired 

Leg condition 

+++ 

++ 

animal health is a major concern for many 

Disease resistance 

+++ 

++ 

citizens and farmers 

Aggressiveness 

+ 

++++ 

Deviant behaviour does not cost much, but it is 

Stereotypic behaviour 


+++++ 

undesired by many citizens and farmers 


the selection index calculations. Their estimation 
is based on relative genetic progress in traits with 
economic and non-economic importance, as 
described for societally important traits by Kanis 
et al. (2005). 


Adaptation of breeding objectives 
to changing economic conditions 

Trends in breeding objectives have been dis¬ 
cussed in detail by Ollivier et al. (1990), Haley 
(1991) and Merks (2000). Past experience 
shows that continuous re-evaluation of the eco¬ 
nomic weights applied to each trait is needed. In 
spite of efforts made to foresee changes in eco¬ 
nomic conditions, this exercise can never be 
entirely satisfactory, because of inevitable dis¬ 
crepancies between present objectives and future 
production conditions. However, this should be 
of little concern as long as the evolution of these 
conditions is slow and gradual, because using 
slightly ‘false’ economic weights has a limited 
impact on overall selection efficiency in most 
cases (Vandepitte and Hazel, 1977). 


Choice of Selection Criteria and their 
Use in Breeding Value Estimation 

A distinction is classically made between traits 
that are considered as objectives for improve¬ 
ment, and traits that are actually used in ranking 


the male and female candidates, which are 
termed criteria of selection. In pig breeding, the 
two sets only partly coincide, and each set 
includes a fairly large number of traits. It is the 
purpose of performance recording programmes 
to define the measurements to be used as selec¬ 
tion criteria and to organize the collection and 
processing of the corresponding data. Such 
programmes will now be reviewed briefly, as 
more details can be found in several textbooks 
on pig breeding, e.g. in Sellier (1986) and 
Glodek (1992), among others. A historical pres¬ 
entation of performance recording and genetic 
evaluation since the beginning of the century 
has also been given by Sellier and Rothschild 
(1991) and Merks (2000). 

Reproduction performance has for a long 
time been assessed through on-farm litter¬ 
recording systems, including litter size at birth 
and at weaning, sometimes completed by litter 
weights, these traits being viewed as the most 
important reproduction traits (see Chapter 10 
for more details). Production traits, namely 
growth rate, feed efficiency and carcass meas¬ 
urements (see Chapter 14), were initially 
recorded almost exclusively in central testing 
stations, built on the Danish model, as progeny¬ 
testing stations became progressively available 
to pig breeders in most countries from the early 
1920s to the late 1950s. An important step 
has been the advent of techniques allowing a 
fairly accurate evaluation of body composition 
on the live pig, starting with the metal ruler of 
Hazel and Kline (1952), and followed by the 
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development of ultrasonic machines and more 
sophisticated technologies such as computer 
tomography (Kolstad, 2001). This has opened 
the way to central performance testing stations 
for young boars, replacing progeny-testing sta¬ 
tions, and also to on-farm testing programmes. 
Boar testing stations usually record average 
daily gain over a given live-weight interval, 
food conversion ratio over the same interval, 
and backfat thickness at the end of test, 
whereas on-farm tests generally include only 
daily gain (expressed as days to a given weight) 
and backfat thickness, lean depth and intra¬ 
muscular fat at a given weight measured 
through ultrasound. More recently, meat and 
fat quality traits have been introduced into test¬ 
ing programmes, based on measurements 
made in slaughterhouses, such as ultimate pH, 
colour and water-holding capacity. More details 
on the meat and carcass traits of interest can 
be found in Chapter 15, and especially on 
advances to be expected regarding live animal 
measurements based on tissue biopsies. 

As shown by Hazel in his classic paper of 
1943, pig improvement is essentially a multiple- 
trait selection problem, which can be solved by 
combining performance records into a linear 
index (I). This index is a predictor of breeding 
value for the overall breeding goal, defined so as 
to maximize the correlation between I and the 
aggregate genotype (H), described in the previ¬ 
ous section. This principle eventually became 
almost universally applied in pig breeding, use 
being made of estimates of the necessary genetic 
parameters (heritabilities and genetic correla¬ 
tions), as presented in other chapters of this 
book. The performance records entering the 
selection indices, collected either on farm or in 
central test stations, are generally expressed as 
deviations from contemporary group means. In 
the meantime, selection index theory was being 
refined through the development of mixed model 
methodologies, taking account of unequal infor¬ 
mation among candidates and unknown means, 
and providing best linear unbiased prediction 
(BLUP) of breeding values. Application of the 
theory then involved a two-step procedure, first 
the estimation of individual trait breeding values 
and, in a second step, the application of relative 
weights to those breeding values for deriving the 
estimation of H. As noted by Hazel et a/. (1994), 
such a procedure brings more flexibility in the 


adaptation of the economic weights to any 
breeding system, without the need to recalculate 
individual breeding values. 

The use of BLUP methodology in a national 
pig breeding programme was done for the first 
time in Canada in 1985 for growth and backfat 
on-farm genetic evaluations (Hudson and 
Kennedy, 1985), and was later extended to 
maternal traits. In the USA, a cooperative project 
between Purdue University and purebred asso¬ 
ciations, termed the Swine Testing and Genetic 
Evaluation System (STAGES), was also initiated 
in 1985, and reported its first evaluations in 
1986 (Stewart et a/., 1991). STAGES and the 
Canadian programme were designed as on-farm 
evaluation systems, with the added capability of 
across-herd analysis performed at longer time 
intervals. Progress in speed of data transmission 
and computing has now made it possible to 
shorten the computation intervals and to make 
centrally produced BLUP values available to 
breeding farms overnight. Such systems have 
been applied by several European countries, in 
most cases with the implementation of multivari¬ 
ate prediction-estimation software such as pest 
(Groeneveld et a/., 1990), asreml (Gilmour 
et a/., 2002) and pigblup (Hermesch and Crump, 
2006). Most breeding organizations also apply 
similar procedures. In addition, there has been 
increased recording of data, both in terms of 
number of traits and number of pigs recorded. In 
many cases, the data are now transferred over 
the Internet from the pig farms to the genetic 
evaluation centres, and the genetic evaluation 
procedures are increasingly automated to com¬ 
pute estimated breeding values (EBVs) using 
millions of records and release them in a 
few hours. Computer software and web 
applications have been made available for 
breeders to define their breeding goals, develop 
customized selection indices, predict expected 
genetic gains and identify selection candidates 
(e.g. Groeneveld, 2004). 

Design of Breeding Programmes 

Breeding for sire lines 

The full breeding objectives are typically met by 
a commercial crossbred pig produced by cross¬ 
ing different sire and dam lines. The contribution 
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of the sire line is through: (i) maximized AI per¬ 
formance (libido, semen quantity and semen 
quality); (ii) improved pre-, peri- and post-natal 
survival; (iii) improved production; and 
(iv) improved carcass and meat quality traits. 
Differences between lines are exploited to adapt 
to changing market conditions, such as differ¬ 
ences in robustness, muscularity and gain. 

Production traits in pigs are measurable 
on both sexes before breeding (growth rate, 
food conversion and backfat thickness), or after 
slaughter (lean content, lean and fat character¬ 
istics). Individual and sib information therefore 
provide the essential criteria for selection in 
most breeding programmes, as additional cull¬ 
ing on progeny performance has been known 
for a long time to be ineffective (Dickerson and 
Hazel, 1944; King, 1955). The expected 
annual response (R a ) in Eqn 16.2 is then maxi¬ 
mized when i/t = (i s + i d )/(t s + t d ), i.e. the annual 
selection intensity is at its maximum. With the 
common demographic parameters of the pig, 
such as first offspring at 1 year of age, six can¬ 
didates per litter at breeding age and a mating 
ratio (sow:boar) of 15 under conditions of natu¬ 
ral mating, it can be shown that the optimal t s 
and t d in individual selection are close to 1 year, 
which yields a maximum i/t ratio of 1.75 
(Ollivier, 1974). For a trait of medium heritabil- 
ity (h 2 ), h 2 = 0.30, a maximum annual response 
of nearly one genetic standard deviation (sd) 
can be obtained with such a method of selec¬ 
tion. When information on slaughtered sibs is 
used in selection, the value of i/t has to be 
adapted to the ensuing reduction in the number 


of candidates remaining and the change of sex 
ratio among them (Ollivier, 1988a). The loss in 
i/t incurred in any sib- or combined sib-individual- 
testing scheme can then be set against the 
increased accuracy of evaluation. 

It should be borne in mind that the 
responses predicted in such a theory refer to 
an idealized situation of a large population of 
sows, farrowing simultaneously at fixed inter¬ 
vals of 6 months, and that selection is assumed 
to be carried out among a large number of 
independent observations. In practice, how¬ 
ever, those assumptions are not fulfilled, as far- 
rowings generally occur quasi-continuously in 
breeding herds, and comparisons between 
candidates are made within ‘batches’ contain¬ 
ing a limited number of full-sib groups. The 
resulting reduction in selection intensity for the 
nested family structure typical of the pig was 
worked out by Meuwissen (1991). It can be 
shown that in a herd producing 100 gilt litters 
per year in 17 batches, the expected maximum 
of i/t in individual selection is reduced by 
approximately 15%, to a value of 1.5, with the 
same demographic parameters as above 
(Ollivier, 1988b). Even the latter objective will 
rarely be achieved in practice because of 
incomplete testing or of culling for reasons 
other than performances. Actual values of i/t 
achieved in individual on-farm tests rarely 
exceed one. Considerably lower values have 
been reported in national programmes empha¬ 
sizing either on-farm tests or central-station 
family selection (Table 16.3). It can be noted 
that the individual selection schemes simulated 


Table 16.3. Retrospective evaluation of selection intensities and generation intervals in breeding 
schemes for production traits. 


Breeding structure 

Selection criterion 

Selection intensity (/') 

Generation interval ( t ) 

i/t a 

Breeding company 

Individual 5 

1.28 

1.25 

1.02 

National programme 

Individual 0 

0.48 

1.92 

0.25 


Sib and progeny 5 

1.46 

2.00 

0.73 


Progeny e 

0.91 

2.00 

0.46 


a For i/t= 0.5, using an individual selection index, expected annual responses of 5 g in average daily gain and -0.4 mm in 
backfat thickness have been given by Sellier (1986, p. 201). The ranges of annual genetic trends reported by Sellier and 
Rothschild (1991) - 3 to 6g and -0.1 to -0.4mm, respectively, based on central testing records - indicate that the 
corresponding i/t would generally be below 0.5. 
b Bichard et al. (1986): 1966-1985; growth and backfat index. 

c Kennedy et al. (1986): 1977-1983; growth and backfat index in retrospect (Canadian Yorkshire and Landrace). 
d Christensen et al. (1986): 1980-1985; index on growth, lean and meat quality (Danish Landrace and Yorkshire). 
e Lundeheim etal. (1994): 1982-1986; growth and lean index in retrospect (Swedish Landrace and Yorkshire). 
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by De Roo (1988) and Belonsky and Kennedy 
(1988) do not fully exploit the potential of such 
schemes either, as the values of i/t realized are 
generally well below one. 

With the advent of BLUP (see the previ¬ 
ous section), records from all relatives, such as 
sibs, cousins and ancestors, can be used to pre¬ 
dict breeding values. Genetic response is then 
expected to increase, following the increases in 
average selection accuracies (p x and p 2 ) of 
Eqn 16.2, predicted from standard index selec¬ 
tion theory, and knowing that those increases 
are partially offset by the decreases expected in 
selection intensities, because of the high cor¬ 
relation between BLUP values of relatives. 
A further increase of response, however, more 
difficult to predict because it depends on the 
breeding structure, results from better estima¬ 
tion of the fixed effects, which allows across- 
farm (or station) evaluation, and also from the 
fact that it takes into account genetic trends 
(Sorensen, 1988). Another advantage of BLUP 
is to allow sequential culling, whereby sows and 
boars are culled on the basis of their estimated 
breeding values, with the best remaining long¬ 
est in the herd. The simulation of Belonsky and 
Kennedy (1988) shows that this culling scheme 
reduces length of generation intervals by 
16-22%, relative to strictly individual selection. 
This advantage actually depends on the herd 
replacement policy, which in that study implied 
culling sows after a maximum of five litters. 
Hagenbuch and Hill (1978) have shown that, 
with a more stringent policy of keeping sows 
only for a maximum of two litters, the gain 
from sequential culling is much reduced, to 
about 2-3%. Overall, the advantage of BLUP 
evaluation over individual selection is in a range 
of 10-30% for traits of high-to-moderate herit- 
ability, i.e. for most production traits, as shown 
by various simulations (Belonsky and Kennedy, 
1988; Sorensen, 1988; Rohe et a/., 1990). 

The gains from BLUP selection are to some 
extent counterbalanced in small populations by a 


higher increase in inbreeding than under indi¬ 
vidual selection, a tendency that is also enhanced 
by sequential culling, as shown by Belonsky and 
Kennedy (1988). Various methods to restrict 
inbreeding without significant loss of response 
were proposed by Toro and Perez-Enciso (1990). 
Another strategy, recommended by Brisbane 
and Gibson (1995), is to include genetic relation¬ 
ships in selection decisions, assuming a given 
value of a unit of inbreeding relative to a unit of 
genetic gain. Meuwissen (1997) developed algo¬ 
rithms for selection that maximize response to 
selection with a constraint on the rate of inbreed¬ 
ing by limiting the average genetic relationship 
among selected individuals. 

Breeding for dam lines 

As shown in Table 16.2, reproduction traits 
have to be included in the breeding objective in 
crossbreeding systems with specialized lines, 
apart from the breeds or lines that only serve 
to produce the terminal boars. However, until 
1995, little attention was paid to such traits in 
most breeding programmes. The situation has 
changed dramatically, as a combination of the¬ 
oretical (see Table 16.4) and experimental 
results (see Chapter 10) has shown that litter 
size can be successfully improved by selection, 
and also that economic conditions have made 
such selection increasingly worthwhile (Haley 
et al., 1988; Ollivier, 1988b). 

The efficiency of simultaneous selection 
for reproduction and production traits has been 
extensively investigated, either by using the 
index selection approach of Smith (1964) or 
by more elaborate methods based on the 
economic returns of the entire crossbreeding 
system (Elsen and Sellier, 1978), life-cycle 
economic efficiency (Smith et a/., 1983) or 
stochastic models in closed dam lines (De Vries 
et al., 1989). The general conclusion of those 
studies is that some benefit is expected from 


Table 16.4. Predicted annual selection response in litter size (number born per litter). 

Ollivier (1973) Avalos and Smith (1987) Toro et al. (1988) De Vries et al. (1989) 

Annual response 0.25 0.47 0.34 0.17 

Selection criterion Dam Dam and family Dam and family Dam and family 

Population size Large Large 100 sows 400 sows 
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the inclusion of the reproduction objective (HJ 
in addition to the production objective (H 2 ) in 
specialized dam lines. The benefit will increase 
with the relative importance (a) of H 1 in the 
overall breeding objective (as defined in 
Table 16.2), with the accuracy (rj of evaluating 
H 1 relative to H 2 (r 2 ), and with an increasingly 
unfavourable genetic correlation (r) between 
and H 2 . A useful approximation of the rela¬ 
tive benefit (RB) in terms of those parameters 
was proposed by W.G. Hill (in Webb and 
Bampton, 1987): 

RB = [(x 2 + xr + 0.5)° 5 + 0.5]/(x 2 + 2rx + 1) (16.5) 
in which x = ar-^/r^ 

In the past 15 years most, if not all, breed¬ 
ing organizations have adopted selection on lit¬ 
ter size (Rydhmer and Berglund, 2006). As the 
larger breeding organizations are competing 
breeding companies, exact estimates of genetic 
trends are difficult to obtain. Merks (2000) 
reported trends on the order of 0.20 piglets 
total born per year, while the breeding pro¬ 
gramme (Anonymous, 2009) reported 0.44 
piglets alive at day 5. Genetic trends appear 
therefore to be in agreement with the estimates 
of Table 16.4, with the Danish programme 
close to the theoretical maximum of Avalos and 
Smith (1987). Selection, however, is not for a 
single trait, and should include other fertility 
traits, such as survival, mothering ability and 
interval weaning to oestrus. The decreasing 
economic value of litter size is interesting. The 
concept of reduction in economic value of litter 
size with increase in sow productivity was fur¬ 
ther demonstrated by Quinton et al. (2006), 
using a profit function approach. It has been 
shown that the relative importance of litter size 
for the finished pig market decreased from 64% 
of the total breeding value, when the average 
litter size was eight pigs, to 29%, when the aver¬ 
age litter size was 20 pigs. At the same time, 
the economic value of perinatal survival 
increased from 17 to 42%, and that of survival 
to weaning increased from 7 to 18%. In this 
approach, using the Canadian data, it has been 
further shown that the relative importance of 
litter size for piglet weaning weight increased 
from 22 to 41% as the average litter size 
increased from eight to 20 pigs, whereas that 
of perinatal survival increased from 12 to 22% 
and that of survival to weaning increased from 


5 to 9%. These results further highlight the 
significance of piglet survival in increasing gains 
from start to finishing. 

Though most studies have so far concluded 
that litter size is genetically uncorrelated with 
growth and carcass traits (see the review of 
Haley et a/., 1988), there are indications that 
this might not be a general rule, as shown by the 
indirect responses observed in the selection 
experiment analysed by Kerr and Cameron 
(1994), or by recent estimates of genetic corre¬ 
lations between reproduction and production 
traits (see Chapter 10). Another approach to 
this correlation is the observation that the 
number of ovulations has increased more than 
litter size, creating a situation of uterine crowd¬ 
ing, with negative consequences for survival 
chances of the fetuses and possible negative 
consequences for the development of the sur¬ 
viving fetuses; this may lead to a reduction in 
muscle fibre development, lower birth weights 
and consequences for finishing gain and carcass 
and meat quality traits (Foxcroft et a/., 2004). 

Selection based on crossbred 

performance 

In addition to selection and performance 
recording within sire or dam lines, there are 
distinct advantages in recording data on cross¬ 
breds and selection on crossbred performance. 
The two main reasons are: (i) the breeding 
objective is based on the crossbred level, cross¬ 
bred data will show additive and non-additive 
expression of genes and pure-line parents can 
be selected that maximize performance at 
this level; and (ii) in smaller dam lines it adds 
records to improve the accuracy of BLUP 
estimates. 

Breeding programmes, therefore, need to 
combine purebred information with crossbred 
information. Predicted responses in a cross¬ 
breeding scheme implicitly assume a genetic 
correlation of 1 between purebred and cross¬ 
bred performance. The use of crossbred infor¬ 
mation, as in reciprocal recurrent selection 
schemes, offers some advantages compared 
with pure-line selection, especially for traits 
showing large non-additive genetic variation, 
as reviewed by Wei and van der Steen 
(1991). Two specific parameters are needed 
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to evaluate the efficiency of such a type of 
selection, namely the genetic correlation 
between purebred and crossbred performance 
(r pc ) and the crossbred heritability (h 2 c ). 
Purebred and crossbred information should in 
fact be combined in an optimal way in order to 
maximize genetic response in crossbreds, 
which can be done within the general frame¬ 
work of index selection theory (Wei and van 
der Werf, 1994). For instance, the crossbred 
paternal half-sib family mean may be com¬ 
bined in such a way with the purebred infor¬ 
mation on paternal half sibs, full sibs and own 
performance. As emphasized by Wei and van 
der Werf (1994), such a combined selection is 
always superior to pure-line selection when 
the testing of crossbreds is not at the expense 
of testing purebreds as, for instance, when a 
crossbreeding structure exists through which 
crossbred information is collected for manage¬ 
ment purposes. In such situations that apply to 
pig breeding (see for instance Fig. 16.2), 
crossbred information may indeed come as an 
addition to purebred data, and the increase 
that can be expected in genetic response will 
then depend on a proper evaluation of the two 
genetic parameters, r pc and h 2 c . 

Bijma et al. (2001) added rate of inbreed¬ 
ing to this approach and analysed short-time 
response and long-term genetic contributions 
using a deterministic approach. For their set of 
parameters, combining purebred and cross¬ 
bred performance yielded a better result than 
the use of purebred information only for the 
long term. 

Except for the theoretical approach as 
outlined here, there are practical and quantita¬ 
tive advantages of using crossbred data and/or 
data from different environments. Automated 
data recording has increased dramatically over 
the past years. Large volumes of fertility data 
and slaughter-line data are readily at hand; the 
advantages are that the performance traits are 
as close to the commercial reality as possible 
and that the traits are observed under cross¬ 
bred and field conditions. The main limiting 
factor is the availability of accurate pedigree 
information for this type of animal. Investment 
of breeding organizations in proper recording 
of sire and dam of a crossbred animal yields 
access to these crossbred field data; DNA tech¬ 
nology can help here. 


Breeding for specific environments 

The cost of running an effective breeding pro¬ 
gramme dictates the necessity of a large sales 
volume and with that a global approach. The 
pork value chain shows the worldwide interest 
in lowering the cost price of meat, but it also 
shows regional differences in production envi¬ 
ronment, products and societal constraints. 
The net result is that the breeding goal should 
be defined at the crossbred level under local 
environment, whether that is large-scale pro¬ 
duction in the Americas or family farms in 
north-western Europe. The result might be line 
differentiation for different markets, or robust 
and flexible lines that have the ability to adapt 
to differing environments. 

In the generalized situation, a trait is 
defined per environment. If the genetic corre¬ 
lation between two production environments is 
lower than 0.4-0.6, then separate lines have 
to be developed (Mulder and Bijma, 2006). 
Bloemhof et al. (2008) showed differences 
between sow lines in reaction to environmental 
temperature at time of insemination. One sow 
line loses farrowing rate and litter size at tem¬ 
peratures above the upper critical level of 
20°C, while the other does not. The authors 
suggest that genetic variation for the slope 
above the upper critical temperature may exist. 
This approach can be extended towards the 
general adaptive ability of the animal; this abil¬ 
ity can then be estimated and added to the 
selection index using, e.g. reaction norm mod¬ 
els (Knap and Su, 2008). 

Transfer of genetic gains in commercial 

populations 

Genetic gain starts with the identification of 
single superior animals and should result in the 
presence of the superior alleles that are present 
in these animals in as many crossbreds as pos¬ 
sible. The female reproductive step is quite 
slow. One sow can produce around nine pro¬ 
ductive female offspring per year. Assuming a 
replacement rate of 40%, one pure-line nucleus 
sow can maintain 22.5 pure-line multiplication 
sows, which use the semen of a second dam 
line to produce 202.5 crossbred gilts. Assuming 
the same 40% replacement, a sow herd of 500 
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commercial sows can be maintained to wean 
15,000 finishers, assuming 30 piglets per sow 
per year. When F 2 production is applied, that 
is, the commercial sow is a three-way cross or 
a backcross or a rotational cross, one purebred 
sow can yield her genes to well over 300,000 
finishers. 

Let us assume the optimal situation for 
pork production to be one purebred sire line 
and a two-way cross on the dam side. This 
structure could be adapted by creating a cross¬ 
bred terminal sire, with improved male repro¬ 
ductive performance (better libido, better 
semen quality) because of heterosis advantage. 
The production traits, which should be trans¬ 
ferred to the offspring, will not benefit from the 
crossbreeding advantage. The negative side, 
the cost of an extra generation, will, in most 
situations, not be offset by the fertility advan¬ 
tage. A second argument in favour of using a 
crossbred boar could be that the traits of two 
existing sire lines are on both sides of a market 
optimum, e.g. one sire line is too muscled with¬ 
out enough gain and the other sire line is the 
reverse. A crossbred boar will then additively 
average the two sire lines. 

Because breeding programmes are more 
and more international, the relevance of health 
status increases. Pure-line production tends to 
take place at high-health farms to facilitate dis¬ 
semination of genetic trend. 


Breeding for particular objectives 

‘Licence to produce’ is increasing in relevance. 
Society starts to question the direction in which 
animal breeding changes the pig populations. 
Neeteson et al. (1999) discuss this field across 
species, and Kanis et al. (2005) discuss, for 
pigs, the following areas for inclusion of soci¬ 
etal traits in the breeding objective: pig welfare 
and health, ecological effects and natural 
resources, and healthiness and the sensory 
quality of pork. 

These society concerns lead to traits on 
animal integrity, e.g. castration, tail docking and 
teeth clipping. Non-castration may potentially 
lead to the presence of boar taint in entire 
males; the underlying substances of boar taint 
are androstenone, a pheromone, and skatole, a 
degradation product of tryptophan. Genetic 


selection against boar taint to completely elimi¬ 
nate the need for castration is possible (Merks 
et al., 2009). This would also take advantage of 
better feed efficiency from entire males. This 
should be done carefully to overcome the poten¬ 
tial risks, such as: (i) that it will influence the hor¬ 
monal balance of the animal, causing male and 
female reproduction reductions; (ii) that a pro¬ 
duction change towards non-castration will dis¬ 
rupt the economic value system, because entire 
males are far leaner than castrated animals and, 
therefore, production animals might suddenly 
be too lean for the average market; and (iii) that 
male behaviour (aggression and mounting) will 
disrupt the finishing phase. The latter problems 
are also the drivers behind tail docking and teeth 
clipping issues. 

Theory on social behaviour was first devel¬ 
oped by Griffing (1967). The principle is that 
the phenotypes of pen mates influence the 
expression of the trait of the selection 
candidate. 


P\ ~ A Dj + L D . 


(16.6) 


n 


n 


(16.7) 


Pj = A Dj + E Di +X A ,. + X.E ■ 

i u, i u, i j:;t j s’j sj 

Equation 16.6 is the classic expression, where P 
denotes the phenotype of selection candidate i, 
A D j is the direct genetic effect of i and E D i is the 
environmental effect for animal i. Equation 16.7 
extends this approach towards the social influ¬ 
ence of group members by including A Sj as the 
social genetic effect of group member j and E S j 
as the social environmental effect of group mem¬ 
ber j. The total genetic merit of an animal is then 
the sum of its additive merit for the trait plus 
(n - 1) times the genetic influence on its group 
members, where n equals average pen size 
(Bijma et al., 2007). First genetic parameter esti¬ 
mates show considerable genetic variation in 
social behaviour effects for the traits gain and 
feed intake (Bergsma et al., 2008). Parameter 
estimates can be found in Chapter 10. An open 
question is whether the estimated social effects 
link with behaviour, as encountered in tail biting 
or male sexual behaviour. Estimation of genetic 
parameters requires large well-defined data sets 
with genetically mixed pens. Animal behaviour 
will become more and more relevant, because 
labour spent per animal decreases for economic 
reasons, and group sizes increase because of 
societal concerns about animal welfare. 
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Another important trait is uniformity. 
Uniformity at slaughter is: (i) relevant to reduce 
the number of pulls and, with that, the average 
usage time of finishing rooms and uniformity in 
birth weight; and (ii) valued because it reduces 
the percentage of small piglets, which have an 
increased risk of mortality. The latter example 
can be seen as a trait of the sow and, with that, 
as the expression of the genes of the sow. The 
first example is more complicated: how to 
reduce variation in a group with selection on the 
genotype of individuals. The classical evolution¬ 
ary answer, natural selection, is stabilizing selec¬ 
tion, which removes some undesirable alleles 
and increases homeostasis of the developmental 
process. A possible way out for human-directed 
breeding programmes is the assumption of 
genetic heterogeneity of environmental vari¬ 
ance or, in other words, to assume that part of 
the environmental factor in the normal genetic 
evaluation (Eqn 16.4) is heritable and identifies 
the ability of the animal to cope with its environ¬ 
ment (Mulder et a/., 2007). Implementation in 
the breeding programme requires an economic 
value. For variance (or uniformity), this eco¬ 
nomic value is much higher if the trait shows an 
economic optimum (Mulder et a/., 2008). 

Other examples of particular objectives 
are the elimination of genetic abnormalities, 
such as the existence of chromosomal aberra¬ 
tions responsible for drastic reductions in litter 
size, as in the case of reciprocal translocations, 
which makes it worthwhile to screen the boars 
of paternal and maternal lines for karyotype 
abnormalities. This screening is systematically 
done in France, yielding 0.44% of the animals 
as affected (Ducos et a/., 2007). 

The development of statistical tools allows 
the application of more complex genetic mod¬ 
els and helps to discover genetic variation for 
traits such as behaviour and uniformity with 
large indirect influence on the breeding goal. 

Marker-assisted Breeding 

Limitations of phenotype-based breeding 

programmes 

The genetic improvement programmes 
described in the previous sections are driven by 


measuring phenotypes for traits of interest on 
selection candidates in the nucleus or on close 
relatives of the selection candidates. These 
phenotypes are then used to estimate the 
breeding values of selection candidates for 
traits and incorporate these in a multi-trait 
selection index to identify individuals that best 
meet the breeding objective. Although this has 
led to impressive increases in performance for 
several traits, the phenotype-based approach 
to genetic improvement suffers from several 
important limitations, including: 

• Several traits have low heritability, e.g. 
reproduction, disease resistance and sur¬ 
vival traits. This limits the accuracy and 
efficiency of selection and, therefore, 
genetic improvement for these traits. 

• Several traits can only be measured later in 
an animal’s life, e.g. sow lifetime productiv¬ 
ity. This either increases generation intervals 
if the choice is made to wait until the 
phenotype can be observed on the selection 
candidates, or reduces accuracy of selection 
if the choice is made to make selection 
decisions before the phenotype is 
observed. 

• Several traits cannot be measured directly 
on selection candidates. For example, many 
meat quality traits require slaughter of the 
animal. Thus, for these traits, genetic 
evaluation is based on phenotypes of 
relatives that are slaughtered, limiting the 
accuracy of selection as well as the selection 
intensity, as individuals that may be potential 
selection candidates are slaughtered. 
Another example is disease resistance, 
which can often not be measured directly 
on selection candidates because they must 
be kept under high-health environments 
that are free of most diseases that affect 
pigs in the production environment. This 
category also includes the problem of the 
genotype-by-environment interactions that 
exist between the high-health purebred 
nucleus environment and the crossbred 
production environment for many 
performance traits. As explained previously, 
this can be overcome by collecting 
phenotype on crossbred progeny in the 
production environment, but this is 
expensive and leads to limited accuracies of 
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selection, higher rates of inbreeding and/or 
longer generation intervals. 

• Some traits are expensive to measure (e.g. 
disease resistance). Phenotype-based 
programmes require routine collection of 
large numbers of phenotypes on the 
selection candidates themselves or their 
close relatives, resulting in costs that may 
not outweigh the benefits. 

• The increased emphasis that is placed on 
relatives in many of the above situations 
also leads to greater rates of inbreeding 
within the selection lines. The higher rates 
of inbreeding result from family members 
having very similar estimates of breeding 
values if these EBVs are heavily based on 
phenotypes of parents, full sibs and half 
sibs rather than on own performance or 
progeny performance. Thus, selection 
tends to be of families rather than individuals 
within a family, resulting in selected 
individuals being more closely related and 
higher rates of inbreeding. 

• Whereas the main focus in pig breeding 
programmes has been on the genetic 
improvement of additive genetics through 
selection on EBV, most traits of interest 
exhibit non-additive effects, e.g. through 
heterosis when crossing breeds. Traditional 
quantitative genetics, however, offers lim¬ 
ited opportunities to select for non-additive 
effects. 

With the implementation of molecular 
genetics in the 1980s and 1990s, in particular 
the discovery of new classes of DNA polymorph¬ 
isms, prospects to overcome these limitations 
of phenotype-based selection emerged with the 
potential to directly select on an individual’s 
genotype for genes or genetic markers that are 
associated with the trait. Progress in developing 
genetic markers and using these to detect 
regions of the genome associated with traits of 
interest (quantitative trait loci (QTLs)) has been 
described in the various chapters dealing with 
growth, meat quality and reproduction. The 
focus of this section will be on how such genetic 
tests can be used for genetic improvement and 
not on specific markers or genes. The use of 
genetic markers to enhance genetic progress in 
the pig has also been discussed by Visscher and 
Haley (1995). A recent review of the use of 


markers to map and use genes for complex 
traits in livestock is in Goddard and Hayes 

(2009). 

Principles of the use of genetic 
markers for genetic improvement 

The use of marker-assisted selection (MAS) 
requires knowledge of genes or markers that 
are associated with the traits of interest and 
quantitative estimates of these associations in 
the population of interest. The principle behind 
finding such markers is to compare the aver¬ 
age trait phenotype of animals that have, e.g. 
genotype AA at a specific marker to the aver¬ 
age phenotype of animals that have genotype 
AG or GG. 

Three types of genetic markers 
for within-breed genetic improvement 

For the purposes of the use of genetic markers 
for selection, three types of genetic tests can 
be distinguished (Dekkers, 2004): 

1. Direct markers: genotype for loci that code 
for the functional mutation that affects the trait. 

2. LD (linkage disequilibrium) markers: geno¬ 
type for loci that are in population-wide linkage 
disequilibrium with the functional mutation. 

3. LE (linkage equilibrium) markers: genotype 
for loci that are in population-wide linkage 
equilibrium with the functional mutation in out- 
bred populations and need to be used on a 
within-family basis. 

Methods for detecting these types of loci 
were described in Chapters 5 and 7. LE mark¬ 
ers can be readily detected on a genome-wide 
basis by using breed crosses or the analysis of 
large half-sib families within a breed. Such 
genome scans require only sparse marker 
maps (150-300 markers across the genome, 
depending on marker informativeness and 
genotyping costs) to detect most QTLs of mod- 
erate-to-large effects. Many examples of suc¬ 
cessful applications of this methodology for 
detection of QTL regions are available in the 
literature (see Andersson, 2001). The LD 
markers are by necessity close to the functional 
mutation for sufficient population-wide LD 
between the marker and QTL to exist (typically 
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within 1 cM, depending on the extent of LD in 
the population, which depends on population 
structure and history). LD markers can be iden¬ 
tified using candidate gene (Rothschild and 
Soller, 1997) and fine-mapping approaches 
(Andersson, 2001; Georges, 2007). Recently, 
high-density single nuclear polymorphism 
(SNF) panels have become available, which 
allow the entire genome to be scanned for 
genetic markers that are associated with the 
trait based on population-wide LD with QTLs. 
Direct markers, i.e. polymorphisms that code 
for the functional mutations, are most difficult 
to detect because causality is difficult to prove 
and, as a result, a limited number of examples 
are available and most are for genes associated 
with genetic defects, rather than with a quanti¬ 
tative trait. 

Direct markers are preferred for use in 
genetic improvement because they directly 
evaluate genotype at the locus that affects the 
trait. The next preferred type of marker with 
regard to use in genetic improvement is LD 
markers, as they allow selection on genotype 
across the population because of the consist¬ 
ent association between genotype and pheno¬ 
type. In contrast, use of LE markers must allow 
for different linkage phases between markers 
and QTLs from family to family, and effects 
must be estimated on a within-family basis. 
Effects of direct and LD markers can be esti¬ 
mated on a random sample of the population, 
in principle without the need for pedigree. 
Because of the complications associated with 
the use of LE markers, they have seen limited 
application in livestock breeding programmes 


(Dekkers, 2004), and the emphasis is on devel¬ 
oping direct and LD markers. 

Methods to incorporate marker data into 
genetic evaluation procedures were summa¬ 
rized by Dekkers and van der Werf (2007). 
They depend on the type of marker used (direct 
and LD versus LE markers), and whether a lim¬ 
ited number or many markers are included, as 
in genomic selection (Meuwissen et al., 2001). 
These methods result in marker-based EBVs 
that estimate the breeding value of an individ¬ 
ual for QTLs that are associated with markers, 
and a ‘residual’ or polygenic breeding value 
that captures the effects of all other QTLs, 
based on phenotype. A simplified example 
with genotypes for three markers is presented 
in Table 16.5. 

Once markers associated with the trait 
have been identified and validated, they can be 
used for genetic improvement. In the follow¬ 
ing, we first describe the use of marker infor¬ 
mation for enhancing the process of integrating 
superior qualities of different breeds, and then 
discuss the use of markers to enhance within- 
breed selection. 

Exploiting between-breed variation 

Crossing breeds results in extensive LD, which 
can be capitalized on using MAS in a number 
of ways. If a large proportion of breed differ¬ 
ences in the trait(s) of interest are due to a small 
number of genes, gene introgression strategies 
can be used. If a larger number of genes are 
involved, MAS within a synthetic line is the 
preferred method of improvement. 


Table 16.5. Example of the calculation of a marker-based EBV (estimated breeding value) and index of 
phenotype (deviated from contemporary mean) and marker-based EBV. a 


Pig 

Marker 1 

Marker 2 

Marker 3 

Marker-based 

EBV 

Own 

phenotype 

Index 

value 

Genotype 

Value 

Genotype 

Value 

Genotype 

Value 

1 

AA 

+10 

AA 

+5 

AA 

-10 

5 

+35 

+15.0 

2 

AA 

+10 

AA 

+5 

BB 

+10 

25 

-10 

+13.3 

3 

AB 

0 

BB 

-5 

AB 

0 

-5 

-15 

-8.3 

4 

AB 

0 

BB 

-5 

AA 

-10 

-15 

+15 

-5.0 

5 

BB 

-10 

AA 

+5 

AB 

0 

-5 

+25 

+5.0 


a Marker-based EBVs are based on three markers with allele substitution effects (allele A versus B) of +10, +5 and -10. 
The markers jointly explain 50% of the genetic variance for a trait with heritability {If) 0.5. Resulting index weights on 
molecular score and phenotype are 2 / 3 and V 3 , respectively, based on Lande and Thompson (1990). 
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marker-assisted introgression. Introgression 
of the desirable allele at a target gene from a 
donor breed to a recipient breed is accomplished 
by multiple backcrosses to the recipient breed, 
followed by one or more generations of 
intercrossing. The aim of the backcross 
generations is to generate individuals that carry 
one copy of the donor QTL allele, but that are 
similar to the recipient breed for the rest of the 
genome. The aim of the intercrossing phase is to 
fix the donor allele at the QTL. Marker 
information can enhance the effectiveness of the 
backcrossing phase of gene-introgression 
strategies: (i) by identifying carriers of the target 
gene(s) (foreground selection); and (ii) by 
enhancing recovery of the recipient genetic 
background (background selection). Effectiveness 
of the intercrossing phase can also be enhanced 
through foreground selection on the target 
gene(s). If the target gene cannot be directly 
genotyped, carrier individuals can be identified 
based on markers that flank the QTL at <10 cM, 
because of the extensive LD that exists in crosses. 
The markers must have breed-specific alleles, 
such that line origin can be identified. For the 
introgression of multiple target genes, gene 
pyramiding strategies can be used during the 
backcrossing phase to reduce the number of 
individuals required (Hospital and Charcosset, 
1997; Koudande et al., 2000). For background 
selection, markers are used that are spread over 
the genome at <20 cM intervals, such that most 
genes that affect the trait will be within 10 cM of 
a marker. Combining foreground and background 
selection, selection will be for the donor breed 
segment around the target locus, but for recipient 
breed segments in the rest of the genome. 
Foreground selection will result in selection for 
not only the target locus, but also for donor 
breed loci that are linked to this locus, some of 
which could have an unfavourable effect on 
performance. To reduce this so-called linkage 
drag around the target locus, greater emphasis 
can be given to markers that are in the 
neighbourhood of the target locus during 
background selection (apart from the flanking 
markers, which are used in foreground 
selection). 

Most studies have considered marker- 
assisted introgression of a single QTL (e.g. 
Hospital and Charcosset, 1997), but often sev¬ 
eral QTLs must be introgressed simultaneously. 


Koudande et al. (2000) showed that large pop¬ 
ulation sizes are needed to obtain sufficient 
individuals that are heterozygous for all QTLs 
in the backcrossing phase. This would make 
marker-assisted introgression not feasible in 
livestock breeding programmes. In many cases, 
however, immediate fixation of introgressed 
QTL alleles may not be required. Instead, the 
objective of the backcrossing phase can be 
to enrich the recipient breed with the favoura¬ 
ble donor QTL alleles at high enough frequency 
such that they can be selected on following 
backcrossing. The effectiveness of such strate¬ 
gies was demonstrated by Piyasatian et al. 
(2008). 

MARKER-ASSISTED IMPROVEMENT OF SYNTHETIC 

lines. With introgression, the aim is to 
recover the recipient breed genotype, except 
for the donor QTL, because of the superior 
general performance of the recipient breed. An 
alternative objective could be to simply aim for 
individuals with highest merit. Selection would 
then be for marker genotype as well as EBV, 
estimated across breeds or lines. This EBV 
selection would replace background selection, 
as recovery of the recipient genotype is achieved 
through selection on genetic merit rather than 
through selecting for breed of origin. This 
strategy would be more competitive if the 
original breeds overlap in merit, and, indeed, 
background selection based on anonymous 
markers would be less profitable, as was shown 
by Dominik et al. (2006). 

Strategies for the use of markers to select 
within a hybrid population were first proposed 
by Lande and Thompson (1990). The strategy 
capitalizes on population-wide LD that initially 
exists in crosses between lines or breeds. Thus, 
marker-QTL associations identified in the F 2 
generation can be selected on for several gener¬ 
ations, until the QTLs or markers are fixed or the 
disequilibrium disappears. Zhang and Smith 
(1992) evaluated the use of markers in such 
a situation with selection on BLUP EBV. 
Piyasatian et al. (2007) showed that the use of 
genomic selection procedures with marker 
effects fitted as random effects resulted in 
improved response and performance. Although 
all these studies considered the ideal situation of 
a cross with inbred lines, there will be opportuni¬ 
ties to utilize a limited number of markers to 



410 


J.C.M. Dekkers et al. 


select for favourable QTL regions that are 
detected in crosses between breeds, thereby 
enhancing the development of superior 
synthetics. 

Exploiting within-breed variation 

Most genetic improvement in pigs is based on 
selection within pure breeds or lines. Within- 
breed selection can be improved with availabil¬ 
ity of genotype on markers that are associated 
with the trait(s) of interest. The benefit of and 
strategy for the use of marker data for within- 
breed selection depend on the proportion of 
genetic variance that is explained by the genetic 
markers and the accuracy with which the true 
association of a genetic marker with the trait is 
estimated. Variance explained by genetic mark¬ 
ers depends on the proportion of QTLs that the 
genetic markers are associated with and on 
how tight the association is between marker 
genotype and QTL genotype, as quantified by 
the extent of LD for LD markers. The accuracy 
of the estimate of the marker-trait association 
depends primarily on the number of individuals 
with phenotype and marker genotype that are 
available to estimate the marker effects, along 
with the genetic architecture of the trait (herit- 
ability, number and effects of QTLs) and the 
statistical method used for estimation. Generally, 
data sets greater than 500 and into the thou¬ 
sands are needed to obtain sufficiently accurate 
estimates of LD-marker effects. As described 
previously, estimates of marker effects can be 
used to develop a marker-based EBV for selec¬ 
tion candidates. 

Unless the accuracy of the marker-based 
EBV is high, it usually is advantageous to com¬ 
bine selection on the marker data with selec¬ 
tion on phenotype or phenotype-based BLUP 
EBV, because the latter captures the collective 
effect of all QTLs, including those not captured 
by markers. In general, four strategies can be 
distinguished for the use of the marker data in 
selection. 

1. Selection on marker genotype or marker- 
based EBV alone. 

2 . Tandem selection, with selection of candi¬ 
dates on marker genotype or marker-based 
EBV, followed by selection on phenotype or 
standard phenotype-based BLUP EBV. 


3 . Simultaneous selection on a combination of 
marker data and phenotype or EBV (index 
selection). 

4 . Preselection on marker data at a young 
age, followed by selection on phenotype or 
EBV at a later age. 

Selection on marker information alone (strategy 
1) ignores information that is available on QTLs 
that are not captured by markers, and is expected 
to result in the lowest response to selection, 
unless most QTLs that affect the trait are included 
in the marker-based EBV, as is anticipated for 
genomic selection (see later). This strategy, how¬ 
ever, does not require additional phenotypes, 
other than those that are needed to estimate 
marker effects, and can be attractive when phe¬ 
notype is difficult or expensive to record (e.g. 
disease traits, meat quality, crossbred perform¬ 
ance in the field). Tandem selection (strategy 2) 
will also not maximize response to selection, 
because selection on the marker data may elimi¬ 
nate individuals that have superior genotypes for 
QTLs that are not captured by the marker-based 
EBV. To prevent this, Smith (1967) and Lande 
and Thompson (1990) suggested combining the 
two sources of information in a selection index 
(strategy 3). Lande and Thompson (1990) 
showed how selection index methods could be 
used to optimize the weights to apply to the 
marker data and phenotype, depending on the 
proportion of genetic variance explained by 
the markers and the heritability of the trait, and 
the increase in accuracy of selection that could 
be expected. The resulting index provides the 
best estimate of breeding value based on marker 
data and phenotype. Extensions of selection 
index methods to include marker data to 
multiple-trait selection are in Lande and 
Thompson (1990) and Weller (2009). Dekkers 
(2007a) showed how selection on a marker- 
based EBV could be incorporated into standard 
selection index software such as SelAction 
(Rutten et al ., 2002) by including the marker- 
based EBV as a correlated trait with h 1 2 = 1 and 
correlation equal to the proportion of genetic 
variance explained by the marker-based EBV. 
Methods that directly incorporate genetic mark¬ 
ers in routine mixed-animal model BLUP genetic 
evaluation (e.g. Fernando and Grossman, 1989) 
also result in EBVs that optimally combine 
marker and phenotypic information. 
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Although index selection is expected to 
result in greater response than tandem selection, 
the choice between tandem and index selection 
(and other alternatives) also depends on other 
factors, such as market and cost considerations. 
For example, rapid fixation of the targeted gene 
(e.g. by tandem selection) will reduce the costs of 
genotyping over generations, and may be desir¬ 
able from a marketing perspective. This can, 
however, also be achieved by increasing the 
weight on the molecular score in an index. 

Tandem and index selection apply to the 
use of marker information within a given stage 
of selection. If selection is over multiple stages, 
selection on molecular score could be empha¬ 
sized at an early age when limited or no pheno¬ 
typic information is available to distinguish 
selection candidates (strategy 4). A prime 
example is in dairy cattle, with preselection 
among full-sib dairy bulls for entry into progeny 
testing programmes (Kashi et a/., 1990; 
MacKinnon and Georges, 1998). Similar 
approaches could also be used in pig breeding 
when deciding which boars to put on, e.g. feed 
intake recording programmes. 

The potential benefits of MAS in pig 
breeding programmes were evaluated by simu¬ 
lation by Meuwissen and Goddard (1996). 
Using LE markers, they demonstrated that 
MAS is mainly useful for traits where pheno¬ 
typic measurement is less valuable because of: 
(i) low heritability; (ii) sex-limited expression; 

(iii) availability only after sexual maturity; and 

(iv) necessity to sacrifice the animal (e.g. slaugh¬ 
ter traits). Benefits were greatest for the latter 
category of traits. To evaluate the benefit of 
MAS for meat quality traits, Meuwissen and 
Goddard (1996) considered two implementa¬ 
tion strategies: 

1. A random two of four members of each full- 
sib family are slaughtered to record meat quality 
data. The remaining individuals are selected on 
the basis of a marker-assisted EBV for meat 
quality, once data on their sibs are recorded. 

2 . Animals are selected on the basis a marker- 
assisted EBV, and non-selected animals are 
slaughtered to provide data for the next gen¬ 
eration of selection. 

Comparisons of these two strategies were 
made to a conventional selection based on 
strategy 1 - but without the availability of 


genetic markers. Strategy 1 gave 24% 
greater response than conventional selection. 
The benefit of strategy 2 was substantially 
greater, but declined over generations as 
favourable alleles at the QTLs were fixed. The 
greater response from strategy 2 than 1 
was in large part the result of the greater 
selection intensity that was achieved with 
strategy 2 because half of the selection 
candidates were not slaughtered before 
selection. 

Marker data can also be used to address 
the problem associated with the genetic 
improvement of crossbred performance in 
commercial environments based on pure-line 
selection in nucleus environments by estimat¬ 
ing effects on phenotypes and genotypes col¬ 
lected on crossbred individuals in the field, and 
using the resulting estimates to estimate breed¬ 
ing values of individuals in the purebred nucleus 
populations based on their marker genotypes, 
as illustrated in Fig. 16.4 (Dekkers, 2007b). By 
analysing data from crossbreds in the field, esti¬ 
mates of marker effects will be for perform¬ 
ance of crossbreds under field conditions, 
rather than for purebred performance in 
nucleus herds. When using LD markers, esti¬ 
mation of effects can in principle be on an 
unpedigreed sample of individuals. This has 
the potential to substantially increase accuracy 
of selection for commercial crossbred perform¬ 
ance, without the increases in generation inter¬ 
vals or rates of inbreeding that are associated 
with phenotype-based combined crossbred/ 
purebred selection (Dekkers, 2007b). 

Commercial application of MAS requires 
careful consideration of economic aspects and 
business risks. Economic analysis of MAS 
requires a comprehensive approach that aims 
to evaluate the economic feasibility and opti¬ 
mal implementation of MAS. An excellent 
example of such an analysis is in Hayes and 
Goddard (2003), who conducted a compre¬ 
hensive economic analysis of the implementa¬ 
tion of LE markers in the nucleus breeding 
programme of an integrated pig production 
enterprise. Detection of QTLs and MAS on 
identified QTL regions for a multi-trait breed¬ 
ing goal and associated genotyping costs and 
extra returns from the production phase of the 
integrated enterprise were considered in the 
economic assessment. Hayes and Goddard 
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concluded that the implementation of MAS 
using LE markers was feasible for the assumed 
cost and price parameters. They also found 
that, in particular, if QTL detection was based 
on small sample sizes, stringent thresholds 
should be set during the QTL detection phase 
such that genotyping costs during the imple¬ 
mentation phase are reduced and selection of 
false positives is minimized. In a related study, 
Hayes and Goddard (2004) evaluated the 
break-even cost of using LD markers devel¬ 
oped using the candidate gene approach. 

History of the development of the use 
of genetic markers in pig breeding 

The assistance that can by provided by genetic 
markers was already illustrated in the 1980s by 
the use of biochemical markers closely linked 
to the halothane gene in order to eliminate the 
susceptibility allele from maternal lines (Mathur 
and Liu, 2003). The objective was to obtain a 
halothane-negative slaughter generation, pro¬ 
tected from the deleterious effects of the reces¬ 
sive susceptibility gene on liveability and 
porcine stress syndrome (FSS)-related meat 
characteristics. This example enters a first cat¬ 
egory of marker-assisted breeding, whereby 
gene frequency at a locus of interest or with 
large deleterious effects can be acted upon 
more efficiently than through a direct approach 
based on the phenotypic effects of the gene 
itself. With the advent of molecular genetics in 
the 1980s, DNA markers have become the 
tool of choice to develop indirect tests that can 
be used to advance selection. Using molecular 
genetics, a large number of markers and genes 
associated with traits of interest have been 
identified in the pig. These are catalogued and 
summarized in a web-based database called 
PigQTLdb (Hu et a/., 2005). As of March 
2010, this database contains reports on 5621 
QTLs on 546 different traits from 237 publica¬ 
tions. A sizeable number of the identified mark¬ 
ers and QTL have shown evidence of 
non-additive effects, including dominance, 
epistasis and genomic imprinting. The latter 
refers to parent-specific expression of genes in 
the progeny, which is a well-known phenome¬ 
non in mammals. Using a genome scan in a 
breed cross, de Koning et al. (2000) identified 


many imprinted QTLs in the pig, and several 
subsequent studies have confirmed these 
results. Jeon et al. (1999) and Nezer et al. 
(1999) found paternal expression for muscu¬ 
larity in the insulin-like growth factor 2 gene 
(IGF2) region of chromosome 2 (SSC2) in 
pigs. The causative mutation for this effect has 
now been identified and its imprinted effect 
confirmed (Jungerius et a/., 2004). 

Despite this wealth of information on 
genetic markers and QTLs associated with 
traits of interest, the application of genetic 
markers in commercial pig breeding has been 
limited. Potential reasons include: (i) only a lim¬ 
ited number of validated genetic markers have 
been available, and the cost of genotyping has 
been relatively high; (ii) the available genetic 
markers explain only a limited amount of 
genetic variation for the trait; (iii) marker or 
QTL effects were estimated on a within-family 
basis or in experimental crosses, which made it 
more difficult to incorporate them into breed¬ 
ing programmes; (iv) marker-trait associations 
did not replicate or were inconsistent across 
populations; and (v) QTLs were detected in 
crosses with wild boar or Chinese breeds and 
the favourable allele was already fixed in com¬ 
mercial breeds. 

Table 16.6 provides a list of examples of 
genetic markers that have been available and/or 
used in pig breeding. It is interesting to note that 
most of these markers are LD markers that were 
identified using the candidate approach. In some 
cases, the genetic test is for the causative muta¬ 
tion. In addition to publicly available markers 
and genetic tests, several breeding organizations 
have had in-house programmes for the develop¬ 
ment and use of genetic markers for genetic 
improvement within their breeding programmes 
(Clutter, 2004; Clutter et a/., 2004; McLaren, 
2007). 

The main use of genetic markers in com¬ 
mercial pig breeding has been to enhance 
genetic improvement within an established 
breed or line by using markers to reduce the 
incidence of deleterious effects (e.g. PSS) or 
to increase the frequency of alleles with 
favourable effects. Methods for incorporating 
genetic markers into breeding decisions are 
not clear, but probably vary from the preselec¬ 
tion of candidates based on genetic markers 
to incorporating markers into routine genetic 
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Table 16.6. Examples of genetic markers and genes used in swine. 


Gene (action/target) 

Traits affected 

Reference(s) 

RYR1 (halothane sensitivity) 

Lean growth, porcine stress 
syndrome, meat quality 

Fuji et al., 1991 

RN (Rendement Napole) 

Meat quality 

Milan et al., 2000 

ESR (estrogen receptor) 

Litter size 

Rothschild et al., 1996; Chen 
et al., 2001 

PRLR (prolactin receptor) 

Nursing performance and litter size 

Vincent et al., 1997; Farmer, 2000 

RBP4 (retinol binding protein 4) 

Litter size 

Messer et al., 1996 

MC4R (melanocortin-4 receptor) 

Lean growth, fatness, feed intake 

Kim et al., 2000 

IGF2 (insulin-like growth factor 2) 

Lean growth, litter size 

Van Laere etal., 2003; Jungerius 
etal., 2004; Liu etal., 2006 

HFABP/AFABP 

Intramuscular fat 

Gerbens et al., 1999 

c-KIT receptor 

Coat and skin colour 

Andersson-Eklund etal., 1996 

MC1R (melanocortin-1 receptor) 

Red/black coat colour 

Kijas et al., 1998 

PRKAG3 (protein kinase 

Meat quality 

Ciobanu et al., 2001 

AMP-activated gamma 

3-regulatory subunit) 

HMGA1 (high mobility group AT 

Backfat thickness 

Kim et al., 2004 

hook 1) 

CCKAR (cholecystokinin type A 

Feed intake and growth 

Houston et al., 2006 

receptor) 

CAST (calpain inhibitor) 

Tenderness 

Ciobanu et al., 2004 

EPOR (erythropoietin) 

Litter size 

Vallet et al., 2005 

F18 

Escherichia coli diarrhoea 

Vogeli et al., 1997 

Meijerink et al., 1996, 2000 

K88 

E. coli diarrhoea 

Jorgensen et al., 2003 


evaluation procedures to enhance the accu¬ 
racy of EBV (Rothschild, 2008). Other uses of 
genetic markers to enhance genetic improve¬ 
ment include: 

• Introgression of favourable genes from 

one breed into another breed; an exam¬ 
ple of marker-assisted introgression is 
the introgression of the halothane- 

resistance allele into the Pietrain breed in 
Belgium (Hanset et a/., 1995). By typing 
the closely linked GPI (glucose phos¬ 
phate isomerase) locus, Hanset et al. 
(1995) were able to fix the Large White 
normal allele and to obtain a halothane- 
negative Pietrain strain after three 

backcrosses. 

• Synthetic line development. 

• Parental identification and verification, 

which can be used to create, and improves 
the accuracy of, pedigree information 

that is used for genetic evaluation in cases 
where the pedigree of phenotyped indi¬ 
viduals is unknown or uncertain. 


While most applications focus on additive 
genetic improvement, the availability of markers 
or genes with large non-additive effects also 
opens opportunities for the strategic use of spe¬ 
cific genotypes in crossbreeding programmes. 
A prime example is the use of the paternally 
expressed IGF2 gene, which has been shown to 
have favourable effects on lean growth and 
reproductive performance, but only for the allele 
that is inherited from the sire (Van Laere et al., 
2003; Buys et a/., 2006, 2009; Liu et al., 
2006; Mathur et al., 2007) owing to an imprint¬ 
ing effect. By producing sows from a cross 
between a boar that is homozygous for the wild 
(G) allele for IGF2, and mating this sow to a ter¬ 
minal sire that is homozygous for the mutant (A) 
allele for leanness, all market pigs will be lean 
because their sire allele is the lean allele, in this 
way also improving uniformity in their carcasses. 
However, by having inherited the wild allele (G) 
from their sire, the sows will have increased 
reproductive performance but will not pass the 
increased fatness trait on to their progeny (Buys 
et al., 2006, 2009; Mathur et al., 2007). 
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In addition to the aforementioned uses of 
genetic markers to enhance genetic improve¬ 
ment, genetic markers also have several other 
uses that are relevant to commercial pig breed¬ 
ing programmes. Examples are: 

• line verification and identification; 

• measuring genetic distances between breeds 
(Foulley et a/., 2006; SanCristobal et a/., 
2006); and 

• traceability, using DNA barcodes to trace 
pork back to the farm of origin or to the 
breeding stock (Loftus, 2005; Webb, 2005). 


Genomic selection 

Recent developments in technology have 
removed some of the limitations of previous 
applications of QTL-mapping results for MAS, 
which, as already mentioned, have limited the 
use of markers in commercial breeding. These 
developments include genome sequencing, the 
identification of large numbers of genetic mark¬ 
ers across the genome in the form of SNPs, 
and the cost-effective high-throughput geno- 
typing of tens of thousands of such SNPs on 
individual animals. Combined with the further 
development of statistical methods for analysis 
of molecular data, this has led to a paradigm 
shift in the strategy of using genetic markers 


for the prediction of breeding values in the 
form of what has been termed ‘genomic selec¬ 
tion’ (Meuwissen et a/., 2001). Genomic selec¬ 
tion (GS) is an enhanced version of MAS that 
involves the selection of animals for breeding 
on the basis of their genotype for tens of thou¬ 
sands of ‘random’ SNPs that cover the genome. 
In GS, the association of each SNP with phe¬ 
notype is estimated using sophisticated statisti¬ 
cal and quantitative genetics models without 
pre-screening markers based on significance. 
This is in contrast to ‘traditional’ MAS, which 
involves a two-step approach, with screening 
of markers based on significance of their asso¬ 
ciation with phenotype as the first step, fol¬ 
lowed by the use of just those markers for 
selection (Lande and Thompson, 1990). 

The main principle of GS is outlined in 
Fig. 16.3, which is based on methods outlined 
by Meuwissen et a/. (2001). The first step is to 
collect phenotypes and DNA from a large 
group of individuals that have been pheno- 
typed for the trait, or that have progeny with 
those phenotypes, and to genotype each ani¬ 
mal using the SNPChip. The resulting ‘training 
data’ are used to ‘train’ a statistical model that 
estimates the effect of each of the SNPs on the 
SNPChip with the trait phenotype. In princi¬ 
ple, the estimate for a given SNP is based on 
the comparison of the average phenotypes of 
individuals that have alternative genotypes at 
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Fig. 16.3. Steps in genomic selection. SNP, single nuclear polymorphism; BV, breeding variation. 
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that SNP, as described above, but with GS this 
is done simultaneously for all markers on the 
SNPChip. The resulting estimates can then be 
used to predict the ‘Genomic’ EBV (G-EBV) of 
new individuals based on their genotypes for 
the SNPChip. 

In general, the prediction model for GS is 
developed by fitting the following linear model 
to phenotypes of individuals that make up the 
training population (Meuwissen et a/., 2001): 

V, = H + 2, X„ b s + e, (16.8) 

where y, is the phenotype (or progeny mean 
phenotype) of individual i in the training data, 
y represents fixed effects, the summation Z j is 
over all genotyped SNPs, X j; . is the number (0, 
1 or 2) of copies of allele ‘1’ (versus ‘0’) that 
individual i carries at SNP j, b j is the allele sub¬ 
stitution effect for SNP j, and e x is a random 
residual. One statistical challenge to fitting this 
model is that the number of markers (>30,000 
is common) typically is much greater than the 
number of animals with phenotypic records 
that are available to estimate their effects (typi¬ 
cally less than 2000). The most common 
method to deal with this is to fit the effect of 
each SNP as a random effect using Bayesian 
methods (Meuwissen et ai, 2001; Gianola 
et al ., 2009). An important distinction exists 
between models that use prior distributions 
that assume genetic variance is equally distrib¬ 
uted across all genotyped SNPs, the so-called 
genomic or G-BLUP method of Meuwissen 
et a/. (2001), and methods that assume that a 
large proportion of SNPs have zero or very 
small effects, such as the Bayes-B method of 
Meuwissen et al. (2001), and other versions 
and implementations of these methods (Gianola 
et al ., 2009; Calus, 2010). 

Because the accuracy of the ‘genomic’ 
EBV (G-EBV) values that are derived using GS 
models are difficult to predict, a validation step 
is usually included (Fig. 16.3). This involves 
separating the data set into training and valida¬ 
tion data sets, developing the GS prediction 
model on the training data set, applying the 
prediction model to compute G-EBV for indi¬ 
viduals in the validation data set and correlating 
the resulting G-EBV with the phenotypes of 
these individuals. Typically, the training and 
validation data sets are separated by year of 
birth of individuals, with the older animals used 


for training and the younger animals for valida¬ 
tion (Habier et ai, 2007). 

In principle, GS represents an extension 
of MAS, as described above, but with some 
important differences. The first is that MAS is 
based on the premise of using a limited number 
of markers that have been shown to have large 
effects on the trait(s) of interest in prior QTL- 
detection or marker-association analyses; in 
contrast, GS uses all SNPs for analysis and pre¬ 
diction, without the screening of associations 
for significance. The second is that genomic 
selection fits all SNPs in the model simultan¬ 
eously, rather than one at a time or by genomic 
region, as is done in typical QTL-detection 
studies. Combined with fitting marker effects 
as random effects, which regresses estimates 
towards zero, depending on the amount of 
information available in the data to estimate 
the effect and the strength of prior informa¬ 
tion, marker-based EBV values obtained from 
GS methods have the potential to capture a 
much greater proportion of genetic variance 
than was possible with the limited number of 
markers included in previous methods for 
MAS. In addition, because estimates are based 
on LD between markers and QTLs that exist 
across the population, G-EBV can in principle 
be derived using estimates of SNP effects 
obtained from phenotypes on individuals that 
are not closely related to the selection candi¬ 
dates. Thus, with GS, in contrast to traditional 
selection, the phenotype that is collected on an 
individual can be used to estimate not only the 
breeding value of the animal itself or its rela¬ 
tives, but also that of unrelated individuals. 
Thus, the need to collect phenotypes on selec¬ 
tion candidates or their close relatives is 
removed, and phenotypes collected can be lev¬ 
eraged for breeding value estimation across dif¬ 
ferent families and generations, and even to 
different populations and breeds. The use of 
GS for training across breeds does require 
denser panels than those that may be needed 
within a breed (e.g. over 300,000 versus 
50,000 SNPs, depending on the distance 
between the breeds involved), because only 
markers that are very close to the QTLs are 
expected to have associations that are consist¬ 
ent across breeds. In addition, this approach 
would fail if the effects of QTLs are not consist¬ 
ent across breeds because of non-additive 
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genetic effects or genotype-by-environment 
interactions. 

Recent applications in real data, in partic¬ 
ular from dairy cattle (Hayes et a/., 2009; 
Loberg and Diirr, 2009), and further theoreti¬ 
cal work by Goddard (2009) and Meuwissen 
(2009) have shown that much larger training 
data sets are needed than indicated by the ini¬ 
tial simulation results (several thousands of 
genotyped and phenotyped individuals). In 
addition, although the theory of GS, as pro¬ 
posed by Meuwissen et al. (2001), is predi¬ 
cated on capturing associations between 
markers and QTLs because of historic LD 
associations between markers and QTLs, it is 
now becoming clear that other factors can 
make major contributions to GS predictions 
from training data with dense and complex 
pedigree and family structures. These include 
genetic relationships (Habier et al., 2007) and 
recent LD and within-family effects (Goddard, 
2008). Although these effects contribute to the 
accuracy of G-EBV for close relatives of indi¬ 
viduals used for training (e.g. their progeny), 
they limit the accuracy of G-EBV for subse¬ 
quent generations, for families that are not well 
represented in the training data and for other 
less related populations. The implication is that 
G-EBV will be most accurate for selection can¬ 
didates that are closely related to individuals 
that are in the training data, and that accura¬ 
cies may be much lower for individuals that are 
less related, in particular for individuals from 
other breeds. Thus, continuous phenotypic 
recording across families may be needed to 
maintain accuracies of G-EBV within a breed. 

Although the cost of high-density SNP 
genotyping has declined rapidly until now in 
2010, costs are expected to remain too high 
for implementation of high-density SNP geno¬ 
typing on a large scale in pig breeding pro¬ 
grammes because of the large number of 
selection candidates that must be evaluated. 
Two approaches have been proposed to 
develop smaller, less costly, low-density geno¬ 
typing panels, namely by: (i) identifying a sub¬ 
set of the high-density SNPs that have strong 
associations with the trait (e.g. Gonzalez-Recio 
et al., 2008; Weigel et al., 2009); or (ii) using 
sparse evenly spaced SNPs across the genome 
to impute high-density SNP genotypes that are 
not on the small panel, on a within-family 
basis. Habier et al. (2009) have shown that, in 


contrast to approach (i), the evenly spaced low- 
density approach results in panels that are not 
trait- or population-specific and are robust to 
the underlying genetic architecture of the trait. 

To fully capitalize on the benefits of 
genomic selection, pig breeding programmes 
may need to be changed substantially because 
genomic selection removes many of the limita¬ 
tions that exist in current phenotype-based 
programmes with regard to when and on which 
individuals phenotypes must be evaluated (see 
Fig. 16.4). Comprehensive strategies for selec¬ 
tive recording of phenotypes with genomic 
selection must be evaluated. Redesign of breed¬ 
ing programmes with genomic selection can 
also capitalize on the greater emphasis that 
genomic selection provides for selection within 
families. This can be utilized in a number of 
ways to: (i) reduce rates of inbreeding; (ii) 
increase response to selection for a given rate 
of inbreeding by increasing selection intensi¬ 
ties; or (iii) reduce the size of breeding popula¬ 
tions and, thereby, costs, while maintaining the 
same rates of response and inbreeding. 

Conclusions 

The efficiency of breeding programmes can be 
assessed by measuring genetic changes occur¬ 
ring over time. The situation in pigs is well 
documented for the most important traits, 
based on the central-station testing and on-farm 
testing records available. From their review on 
genetic trends estimated in large national pure¬ 
bred populations, Sellier and Rothschild (1991) 
conclude that appreciable genetic gains have 
been obtained for growth and body composi¬ 
tion traits, of the order of 0.5-1.5% of the 
mean annually over periods of time ranging 
from 5 to 10 years in most studies. Annual 
genetic gains of about 0.01-0.02 piglets born 
have been reported by Bidanel and Ducos 
(1994), over the period 1975-1991. Since 
1995, breeding organizations have grown 
more and more commercial, as well as more 
and more international, and official publication 
of genetic trends has unfortunately disappeared. 
Phenotypic trends are clearly positive though, 
as can be observed from national production 
data. A phenotypic trend of around 0.5 piglets 
per sow per year appears to be feasible, with 
the larger part most probably being genetic. 
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Estimate marker effects at commercial level 


Fig. 16.4. Selection of pure lines for commercial crossbred performance based on estimates of marker 
effects estimated on crossbreds. EBV, estimated breeding value. Based on Dekkers (2007b). 


It can be concluded that efficient breeding 
schemes can help the pork industry to contin¬ 
uously adapt to a changing world. Directions 
of these changes are adaptation to different 
environments, adaptation to a changing econ¬ 
omy (reduction in labour), adaptation to chang¬ 
ing society concerns (castration, tail docking, 
group housing), and the desire for stronger 
and more robust animals. These changes 
require appropriate data recording, preferably 
automated, and associated pedigrees, prefer¬ 
ably on a crossbred level. 

The most important driving factor for the 
pork industry is still cost price reduction: more 
piglets per sow per year with low feed costs 
and low labour input under a large range of 
environments. 

The paradigm shift, which is currently 
under way, relates to the possibilities of genom¬ 
ics. The balance between phenotyping and 
genotyping is a delicate one. The aforemen¬ 
tioned changes require that new phenotyping 


and appropriate phenotyping are prerequisites 
for: (i) QTL detection; and (ii) training for 
genomic selection. Costs involved in maintain¬ 
ing enough populations of adequate size, phe¬ 
notyping, genotyping and genetic evaluation 
infrastructure are high and require more and 
more cooperation or consolidation. 

Genetic variation in the pig, however, is 
very large, and populations can adapt quickly 
to changing demands of the farmers in terms 
of efficiency, of the consumer in terms of safety 
and of society in terms of adaptation to the 
environments offered. The new era of genom¬ 
ics offers tremendous opportunities for this 
adaptation. 
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Introduction: Creating the Building 
Blocks - Genomics, Transgenesis 

and Cloning 

Obtaining a complete draft of the pig genome 
sequence has been central to the develop¬ 
ment and broad acceptance of the pig as a 
biomedical model (Schook et a/., 2005a,b). 
The pig genome sequence has recently been 
completed (http://www.ensembl.org/Sus_ 
scrofa/Info/Index), and the key building 
blocks for full utilization of the pig as a bio¬ 
medical model are now in place: completed 
genome sequence, ability to produce trans¬ 
genic animals and the ability to replicate the 
model through somatic cell cloning (Schook 
et a/., 2005b). The emergence of genetic 
information and the development of the 
necessary tools to target manipulations, in 
combination with the ability to clone pigs, 
provide a new and highly relevant animal 
model. These building blocks have stimulated 

426 


the development of ‘genomic postulates’ 
(Table 17.1) for evaluating animal models 
and, relevant to this chapter, the significance 
of the pig. This chapter was developed to 
provide background on the need for relevant 
animal models and to address each of the 
aspects of the genomic postulates. Owing to 
the overwhelming physiological (Tumbleson 
and Schook, 1996) and genomic similarities 
between pigs and humans (Humphray et a/., 
2007), the pig provides a uniquely relevant 
animal model for human disease. In addition, 
a recent CRISP (Computer Retrieval of 
Information on Scientific Projects) search 
(1999-2003) indicated that the US National 
Institutes of Health (NIH, which has over 
20 institutes and centres) sponsored research 
that supported 2400 separate grants that 
utilized the pig. Thus, a broad foundation 
supporting the pig as a model in biomedical 
research already exists from which to build 
future programmes. There is also growing 

©CAB International 2011. The Genetics of the Pig, 
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Table 17.1. Genomic postulates, adapted from 
Koch’s postulates. 

1. Isolate and propagate causal gene from animal 

2. Characterize (manipulate) gene in vitro 

3. Reintroduce putative gene (create transgenic 
animal) to test causality 

4. Demonstration of causal relationship through 
induced phenotype 


interest within the biomedical community 
with respect to the utilization of pigs in bioen¬ 
gineering, imaging and behavioural studies. 

The Animal Model Concept 

The use of animals to study human physiology 
and anatomy can be traced back to the second 
century common era (CME) in which Galen, a 
Greek physician and philosopher, completed 
research studies on apes and pigs (Galen, 1586) 
(Fig. 17.1). Galen incorrectly assumed that all 
extracted information derived from his use of 
animals could be directly applied to humans. It 
was not, however, until the 16th century CME 
that his error was initially recognized (Nomura 


et a/., 1987), when Bernard proposed the use 
of chemical and physical induction of disease in 
animals, thus becoming the first advocate for 
creating ‘induced animal models’ for biomedi¬ 
cal research. At the turn of the 20th century 
came the development of infectious disease 
animal models and their use for evaluating anti¬ 
bacterial drugs, and the introduction of the 
‘germ theory of disease’ (Koch, 1884; Fanning, 
1908). The end of the 20th century and the 
beginning of the 21st century realized the abil¬ 
ity to utilize naturally occurring models resulting 
from spontaneous mutations - severe com¬ 
bined immunodeficiency (SCID) or nude mice - 
and from genetically modified animal genomes 
through transgenesis or site-directed homolo¬ 
gous recombination. Linkage with the ability to 
clone animals, through the utilization of either 
embryonic stem cells or somatic cell nuclear 
transfer, provided even further ability to use ani¬ 
mals that have phenotypic characteristics close 
to humans as relevant animal models for dis¬ 
secting human disease. Finally, the emergence 
of the whole genome sequencing of animals 
with many physiological similarities to the 
human, such as the pig, supports the ability to 
actually create a large animal model that is 



Fig. 17.1. Timeline of animal models. SCID, severe combined immunodeficiency. Sources: a Galen, 1586; 
b Koch, 1884; c Fanning, 1908; d Dunn, 1965; e Mahley et ai, 1975; f Pantelouris, 1968; 9 Hardy etai, 1981; 
h Brinster et ai., 1984; 'Waters et a!., 1998;'Cooper et a!., 2002; k Laske et a!., 2005; 'Adam et a!., 2007. 
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genetically and phenotypically similar to humans 
in terms of disease attributes. 

Animal models represent important tools 
for investigating the pathogenesis of human dis¬ 
ease and developing appropriate treatment 
strategies. The coupling of genomic information 
(genome sequence, gene expression profiling 
and proteomics) with enabling technologies 
(transgenesis and cloning) has revolutionized the 
development of human biomedical animal mod¬ 
els. Traditionally, the mouse has been a powerful 
experimental system for understanding the com¬ 
plexity of cancer, diabetes and cardiovascular 
disease, among others. The dog is also consid¬ 
ered a comparable model to human disease 
because of its similarities to human anatomy 
and physiology, particularly with respect to the 
cardiovascular, urogenital, nervous and muscu¬ 
loskeletal systems. As such, it has long been 
used as a model in drug discovery and develop¬ 
ment research. Human disease may best be 
recapitulated in a large mammal such as the pig. 
The pig is often the primary biomedical model 
for a number of diseases, for surgical research 
and for organ transplantation owing to the simi¬ 
larity in size, anatomy and physiology between 
pigs and humans (Swanson etal., 2004). Animal 
models, regardless of species, can be grouped 
into one of the following five categories: 


(i) spontaneous models; (ii) genetically modified 
models; (iii) induced or experimental models; 
(iv) negative models; and (v) orphan models 
(Table 17.2). 

One approach to studying human disease 
is to characterize a naturally occurring disease in 
an animal that corresponds to a human disease. 
The best-known spontaneous model is the athy- 
mic nude mouse, the use of which represented 
a turning point in the study of heterotrans- 
planted tumours and enabled the first descrip¬ 
tion of natural killer cells (Pantelouris, 1968). 
Genetically engineered models were created 
that harboured genetic changes commonly 
found in human disease. The first transgenic 
mouse tumour model was established by over¬ 
expression of viral and cellular oncogenes in 
specific tissues (Brinster et a/., 1984; Stewart 
et a/., 1984; Adams et a/., 1985; Hanahan, 
1989). Induced models involve healthy animals 
in which the condition to be studied is experi¬ 
mentally induced through surgical modifications, 
genetic modifications or chemical application - 
demonstrated in 1918 when Yamagiwa and 
Ichikawa showed that coal tar experimentally 
applied to rabbit ears caused skin carcinomas 
(Yamagiwa and Ichikawa, 1918). More recently, 
considerable insight has been gained into 
the strengths and weaknesses of toxicity and 


Table 17.2. Advantages and disadvantages of animal model types. 


Model type 

Advantages 

Disadvantages 

Examples 

Spontaneous 

Similar disease 
phenotype to 
humans 

Long latency 

Not genetically 
defined 

Nude/severe combined immunodeficiency 
(SCID) mice (Pantelouris, 1968) 

Canine haemophilia (Giles etal., 1982); 
canine prostate cancer (Waters etal., 1998) 

Genetically 

modified 

Defined genetic 
background 

Phenotypic 
expression of 
genes can differ 
Transgenesis and 
homologous 
recombination 

Porcine tumour model (Adam etal., 2007) 

Mouse tumour model (Brinster etal., 

1984) 

Induced or 

Gene expression 

Not predictive of 

Atherosclerosis (Mahley etal., 1975; 

experimental 

controlled through 
diet or inducers 

Rapid disease onset 
Free choice of species 

therapeutic 

success 

Bell and Gerrity, 1992; Dixon etal., 

1999) 

Obesity (Spurlock and Gabler, 2008) 
Diabetes (Mordes and Rossini, 1981; Larsen 
etal., 2002; Larsen and Rolin, 2004) 

Orphan 

Useful for evaluation of 
chemical/radiological 
treatments 

Do not faithfully 
mimic human 

disease 

Feline leukaemia (Hardy etal., 1981); 
bovine leucosis (Gillet etal., 2007) 
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carcinogenicity studies in laboratory rats and 
mice. Infectious disease models are often 
restricted to a limited number of susceptible spe¬ 
cies, and the remaining unresponsive species 
are considered negative models because they do 
not develop the disease when exposed to a par¬ 
ticular stimulus (Hau, 2008). The main applica¬ 
tion of negative models is to gain insight into the 
physiological basis of disease resistance. There 
are functional disorders present in non-human 
species that have not yet been described in 
humans. Often, a similar disease will be identi¬ 
fied in a human that was previously described in 
animals. These animals represent ‘orphan mod¬ 
els’ for that particular disease as no human 
equivalent has been identified. Feline leukaemia 
(FeLV) represents a naturally occurring disease 
in domestic cats that is not transmissible to 
humans; like lymphoma in humans, lymphoma 
induced by FeLV in cats is characterized by 
immunosuppression. 

The incidence of chronic disease due to 
complex genetic and environmental interac¬ 
tions, however, has continued to increase dur¬ 
ing the past century. Understanding human 
disease is difficult owing to the complexity of 
genetics and lifestyle interactions, and the high 
cost associated with developing therapeutics. 
As such, appropriate biomedical models are 
essential because most medical knowledge, 
treatment regimes and medical device develop¬ 
ments are based on robust animal models. As 
genomic and bioinformatic technologies con¬ 
tinue to advance, our knowledge of animal 
models will increase, thereby refining our 
choice of models and enabling the develop¬ 
ment of more applicable models. Animal mod¬ 
els are essential tools for studying gene-gene 
interactions and gene-environment effects, 
and for preclinical testing of therapeutic inter¬ 
ventions. Given that mice, the most common 
animal model, frequently do not faithfully 
recapitulate human disease, pigs will continue 
to serve as important biomedical models. 


Utilizing the Pig to Improve 
Human Health 

During its multiple domestication events, the pig 
has undergone intense selection pressures for 


various phenotypes throughout the world (Chen 
et a/., 2007). First domesticated in Asia from 
the wild boar, germplasm was quickly moved 
around the world by explorers and used for food 
and products. Intense selection and breeding 
have provided distinct phenotypes differing in 
metabolism, fecundity, disease resistance and 
meat products (Schook et a/., 2005b; Schook, 
2007). Such selective pressures have resulted in 
differentiated subpopulations and phenotypes 
extremely relevant to current and future human 
health research. The selection of ‘mini’ and 
‘micro’ pigs for size, independently by investiga¬ 
tors throughout the world, attests to the global 
relevance of this experimental animal in bio¬ 
medical research. The porcine model is also rel¬ 
evant to human health research priorities such 
as obesity, female health, cardiovascular dis¬ 
ease, nutritional studies (as the pig is an omni¬ 
vore), and communicable diseases (reviewed in 
Tumbleson and Schook, 1996). The pig pro¬ 
vides a valuable biological model in these prior¬ 
ity areas because of the vast amount of research 
that has been conducted on the genetic and 
environmental interactions associated with com¬ 
plex, polygenic physiological traits. 


Informing Human Physiology: 
Similarities between Pig and Human 

Phenotypes 

Animal physiology has significantly contrib¬ 
uted to the basic understanding of human 
development and physiology related to disease 
(Table 17.3). For example, classical endo¬ 
crinology studies in pigs have led to the cur¬ 
rent understanding of several reproductive and 
pituitary hormones, most notably the compo¬ 
sition of insulin, which was first determined for 
porcine insulin and was used for several dec¬ 
ades to treat human diabetes (Rohrer et a/., 
2003). The porcine biomedical model has 
provided a fundamental research platform for 
developing human reproductive techniques 
and for studying reproductive diseases. 
Ongoing research using the pig to study can¬ 
cer and diabetes is contributing greatly to our 
understanding of these diseases and is further 
expanded upon in this chapter (Table 17.3). 
The pig has many similarities in structure and 
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Table 17.3. Validated swine biomedical models. 


Type of investigation Model 


Reference(s) 


Heart physiology Stent design, tissue engineering 

of blood vessels 
Atherosclerosis 
Myocardial infarction 
Ex vivo heart model 
Emergency procedures 

Reproductive Maternal-fetal interactions 

function 

Embryo development 
Sperm 

Transplantation Cell and organ transplants 

Xenotransplantation 

Skin physiology Percutaneous permeation 

Contact dermatitis 
Skin culture model 
Melanoma 

Brain Stroke 

AIDS, dementia 
Drug-binding sites and 
interactions 

Gut physiology and Gut structure and intestinal 
nutrition metabolism 

Obesity 

Probiotics and gut physiology 
Food allergies 

Biochemical Response to injury 

Imaging techniques 
Osteoporosis, bone density 
analysis 

Tissue engineering Cartilage repair 

Spinal fusion 

Organ-specific gene delivery 
Cataract repair 
Polymer scaffolds 
Tooth development 

Respiratory Neonatal respiratory distress 

function Asthma 

Infectious disease Therapeutics (vaccines, 

biotherapeutics, drug therapies) 
Developmental interactions 
Mucosal tissue responses 

Host response 


Bedoya et al., 2006; Gyongyosi et al., 2006 

Turk and Laughlin, 2004; Turk et al., 2005 
Ambrose, 2006; Boluyt et al., 2007 
Laske et al., 2005 

Casas etal., 2005; Geddes et al., 2006 
Green et al., 2006 

Sun and Nagai, 2003; Rohrer et al., 2006 
Strzezek et al., 2005; Lavitrano et al., 2006 
Larsen and Rolin, 2004; Street et al., 2004 
Cooper et al., 2002; Ibrahim et al., 2006 
Simon and Maibach, 2000; Dalton et al., 2006 
Stuetz et al., 2006 
Huang et al., 2006 

Geffrotin et al., 2004; Zhi-Qiang et al., 2007 
Imai et al., 2006 
Tambuyzer and Nouwen, 2005 
Minuzzi et al., 2005 

Eubanks et al., 2006; Qiu et al., 2006 

Brambilla and Cantafora, 2004 

Reid etal., 2003; Domeneghini etal., 2006 

Bailey et al., 2005; McClain and Bannon, 

2006 

Schmitt and Snedeker, 2006 

Ellner et al., 2004; Goldberg et al., 2004 

Teo et al., 2006 

Chang et al., 2006 

Drespe etal., 2005 

Kawashita et al., 2005 

Lassota et al., 2006; van Kooten et al., 2006 

Brown etal., 2006; Moroni et al., 2006 

Hu et al., 2005 

Miller et al., 2006 

Turner etal., 2002; Watremez et al., 2003 
Gonzalez et al., 2004; Cheetham et al., 2006 

Hasslung et al., 2005; Butler et al., 2006 
Dawson et al., 2005; Elahi et al., 2005; 

Pomeranz et al., 2005; Dvorak et al., 2006 
Houdebine, 2005 


function to humans, including size, feeding omnivore, it provides an adaptable model to 
patterns, digestive physiology, dietary habits, evaluate chronic and acute exposures to xeno- 
kidney structure and function, pulmonary vas- biotics such as alcohol, tobacco, feed additives 
cular bed structure, propensity to obesity, res- and environmental pollutants (Schook, 2007). 
piratory rates and social behaviours (Tumbleson Pigs have been used as models to evaluate 
and Schook, 1996). Because the pig is an alcoholism, total parenteral nutrition, organ 
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transplantation, atherosclerosis, exercise, 
hypertension, melanoma, nephropathy, der¬ 
mal healing, shock and degenerative retinal 
diseases. 

A severe shortage of organs and tissues for 
transplantation has also stimulated increased 
consideration of pigs as a potential solution, 
particularly with the recent ability to genetically 
modify pigs to overcome acute rejection (Lai 
et a/., 2002). Targets for the genetic modifica¬ 
tion of pigs for xenotransplantation have thus 
far emphasized reducing the immunogenicity of 
porcine cells and tissues and preventing rejec¬ 
tion after transplantation of porcine tissue. 
Acute rejection is mediated through preformed 
antibodies against galactosyl-a-1,3-galactose 
epitopes expressed on the surface of pig cells. 
Transgenic pigs have been developed that 
express regulators of the complement cascade, 
including CD55, CD59 and CD46, which sup¬ 
press the attack on donor tissues (Bucher et a I., 
2005; Cox and Zhong, 2005; Houdebine, 
2005). Another approach has focused on elimi¬ 
nating the galactosyl-a-1,3-galactose antigen 
from the surface of donor cells. Researchers 
have generated pigs without the gene encoding 
a-l,3-galactosyltransferase (Zhong, 2007). 
This was accomplished by the serial knockout 
of the gene in cultured pig fibroblasts, followed 
by somatic cell nuclear transfer to generate 
pigs. The convergence of transgenic and clon¬ 
ing techniques has enabled multilayered genetic 
modifications to be made in a single animal. 

Breeding among multiple existing trans¬ 
genic lines and introducing new genes by 
somatic cell nuclear transfer can be used in 
combination to overcome the various stages of 
xenograft rejection associated with xenotrans¬ 
plantation (Matsunari and Nagashima, 2009). 
The necessary genetic modifications are 
dependent on the specific transplant proce¬ 
dure. For example, the removal of the aGal 
epitope to prevent antibody reactivity and the 
insertion of complement regulators would 
increase the success of vascularized grafts, 
while pancreatic islet grafts would require the 
insertion of complement regulators, anticoag¬ 
ulants to prevent an inflammatory reaction 
and an anti-apoptotic gene to counteract 
ischaemia and reperfusion injuries (d’Apice 
and Cowan, 2009). Using these approaches, 
polytransgenic and a-l,3GalT-KO pigs have 


been produced, but further research is needed 
to create an efficient model (Rood et a/., 
2005; Tseng et a/., 2005; Yamada et a/., 
2005; Cooper et a/., 2007). 

Phenotypic research utilizing unique pig 
breeds has identified genetically controlled dif¬ 
ferences in fat deposition (Rothschild and 
Ruvinsky, 1998; Malek et a/., 200la,b). Such 
information provides the basis for developing 
an experimental model for understanding obes¬ 
ity and for the development of nutritional inter¬ 
ventions from prenatal nutrition to aged 
cohorts. Porcine resource populations have 
been selected for phenotypic variation in bone 
density (osteoporosis), sex-expressed nutri¬ 
tional and reproductive characteristics, and 
growth and development (embryonic, prenatal 
and postnatal). Using comparative genomics, 
new models have been identified to study how 
metabolism is linked to obesity-induced diabe¬ 
tes (Milan et a/., 2000). The porcine model will 
also be invaluable to study host-pathogen 
interactions for food safety (e.g. Salmonella ), 
potential biological warfare agents (African 
swine fever; foot-and-mouth disease) and 
agents that affect food security and human 
health (e.g. porcine endogenous retroviruses 
and other zoonotic diseases). 

Linking Genotypes and Phenotypes 
Relevant to Human Health 

The discovery that mammalian genomes prob¬ 
ably contain only 20,000-30,000 genes 
suggests that alternative transcripts and post- 
translational modifications must play a greater 
role in phenotypic expression than previously 
appreciated. It is also expected that single gene 
products affect different traits or disease states 
depending on the temporal and spatial pres¬ 
ence of gene products. As an omnivore, the 
pig is prone to many of the same dietary health 
problems as humans. Depending on diet and 
genetics, pigs can suffer from hypertension, 
hypercholesterolaemia, dyslipidaemia, insulin 
resistance and atherosclerosis. The pig has 
mutations in similar genes affecting these met¬ 
abolic disorders (e.g. ApoB and LDLR for 
hypercholesterolemia) (Ajiello et a/., 1994; 
Hasler-Rapacz et a/., 1998). Piglets are the 
preferred model organism to develop human 
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infant formula as their nutritional needs are 
comparable to those of human infants. Because 
of their similar digestive tracts, pigs are also 
susceptible to comparable enteric food-borne 
pathogens (e.g. Salmonella , enterohaemor- 
rhagic Escherichia coli) and pig intestinal lin¬ 
ings are used for in vitro studies of interactions 
with the intestine and these pathogens. Pigs 
are also susceptible to gastric ulcers that appar¬ 
ently are induced by diet and stress (Engstrand 
et a/., 1990). Additional anatomical similarities 
with humans include renal morphology, eye 
structure, skin and tooth development. The pig 
is also one of few animals that will voluntarily 
eat to obesity, as well as being susceptible to 
alcoholism. 

There are two reasons for research to 
investigate obesity-related genes in the pig. 
First, as already mentioned, the pig is a more 
realistic model organism for human obesity 
owing to its physiological similarities to humans 
(Tumbleson and Schook, 1996). As the pig is a 
true omnivore, the molecular basis and diges¬ 
tive tract anatomy of the pig are much closer to 
those of humans than any laboratory animal 
species, as identified by significant DNA poly¬ 
morphisms of obesity-related genes in the pig 
genome that might provide useful targets for 
the genetic study of human obesity. The sec¬ 
ond reason is that the genetic components of 
human obesity can play important roles in pig 
performance traits such as fatness, growth rate 
and feed intake. 


Surrogate Systems for Human 
Experimentation 

The domesticated pig has provided numerous 
surrogate experimental models for biomedical 
research. There has been a long tradition of 
using abattoir tissues for the purification of 
enzymes and the elucidation of metabolic path¬ 
ways. These tissues have also served as initial 
biologicals, with bovine and porcine insulin 
providing pre-recombinant DNA therapeutics 
and purified enzymes used to determine crys¬ 
talline structure. Porcine gamete biology has 
played a critical role in our understanding of 
stem cells and in vitro fertilization (Wu et ah, 
2001; Yin et ah, 2002). Because of the wealth 


of biological information derived from the por¬ 
cine system, it has increasingly become import¬ 
ant for studying epigenetic effects, as well as 
unravelling genomic imprinting. The demon¬ 
stration that pigs can be cloned using in vitro 
cloning systems provides an invaluable tech¬ 
nology platform for developing relevant clones 
of genetic models for biomedical research 
(Betthauser et a/., 2000; see Chapter 11). In 
addition, a major obstacle for producing cloned 
genetically modified pigs has been overcome 
(Lai et a/., 2002). Investigators have created a 
nuclear transfer technology using clonal fetal 
fibroblasts as nuclear donors for the production 
of gene-specific knockouts. This technology 
platform has significant applications beyond 
xenotransplantation, and the availability of 
genomic sequences will facilitate the broader 
utility of the pig as a surrogate system for 
human experimentation. 

The phenotypic diversity of hundreds of 
porcine breeds distributed throughout the 
world provides a tremendous resource for 
‘comparative phenomics’, the application of 
comparative genomic principles to the discov¬ 
ery of new genes underlying diverse pheno¬ 
types. In only a few thousand years, selective 
breeding has produced pig breeds that thrive in 
diverse environments (e.g. high altitude versus 
tropical), convert energy to muscle mass effi¬ 
ciently and rapidly, and tolerate specific patho¬ 
gens. There can be little doubt that the 
understanding of what makes porcine breeds 
different with respect to reproductive effi¬ 
ciency, bone structure, growth rates, fat depo¬ 
sition, altitude or heat tolerance and resistance 
to specific pathogens will be important to 
understanding basic biological processes 
important to human health (see Chapter 18). 


Extrapolation from Animals 

to Humans 

The selection of an animal model depends on a 
number of factors relating to the hypothesis to 
be tested. Often a number of different models 
may advantageously be used to study a biologi¬ 
cal phenomenon associated with a human dis¬ 
ease. For diseases such as cancer, there is a 
wide range of well-described models available, 
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both induced and spontaneous, in a variety of 
species. The key factor in using animal models 
for studying disease is that the results can be 
extrapolated to humans. Animal models of 
human disease are deemed relevant only if they 
are useful in recapitulating disease pathogenesis 
and assisting in the development of approaches 
to intervention or therapy (Hau, 2008). Thus, 
to ensure full utilization, a model needs to reli¬ 
ably mimic the normal anatomy and physiology 
of human organs and tissues of interest, as well 
as accurately reflect the morphological and bio¬ 
chemical aspects of disease pathogenesis. 

The rationale behind extrapolating results 
from an animal model to humans is primarily 
based on the similarity between morphological 
structures and physiological processes. For ex¬ 
ample, an animal model of cancer should ideally 
undergo tumour development and progression 
in a similar fashion to humans. While many ani¬ 
mals are more or less similar to humans in 
regard to biological characteristics, there are 
prominent differences in body size between spe¬ 
cies, which affects their appropriateness as a 
model for certain experiments. The validity of 
extrapolation may be further complicated by the 
prevalence of disease in humans, with certain 
sectors of the population having a higher inci¬ 
dence of one type of disease over another owing 
to genetic and environmental influences. 

Traditionally, animal models were used to 
identify the genes responsible for a disease. 
Trends in the use of animal models are chang¬ 
ing as new technologies are enabling research¬ 
ers to use animal models to study the effects of 
changes in genetic pathways. Developments 
in the fields of genomics, proteomics, biotech¬ 
nology and bioinformatics are changing the 
nature of biomedical research. The Human 
Genome Project is providing genetic informa¬ 
tion, not only from humans, but also from ani¬ 
mals traditionally used as models. Increased 
insight into genetic pathways and gene- 
environment interactions that are involved in 
the aetiology of complex human genetic dis¬ 
ease is providing the knowledge required to 
select better animal models. This knowledge 
can be applied to produce specific transgenic 
animals or knockouts, which better mimic the 
physiological complexity of human disease 
than existing models. New, more precise mod¬ 
els for the development of therapeutics can be 


created. Animal models are essential tools for 
studying gene-gene interactions and gene- 
environment effects, and for preclinical testing 
of therapeutic interventions. 

An important theme in toxicology research 
is the search for and the assessment of animal 
models that are predictive for adverse effects of 
pharmaceuticals in humans. This process is 
based on the assumption that the current 
choice of animal models is truly predictive of a 
human response to a treatment. To validate 
this assumption, a large multinational pharma¬ 
ceutical company survey analysed data com¬ 
piled from 150 compounds to determine the 
concordance of the toxicity of pharmaceuticals 
observed in humans with that observed in 
experimental animal models (Olson et ai, 
2000). The concordance rate was found to be 
71% for comparable target organs in rodent 
and non-rodent species, with non-rodents 
alone being predictive for 63% (primarily the 
dog) of human toxicity and rodents alone for 
43% (primarily the rat). The highest incidence 
of overall concordance was seen in haemato- 
logical, gastrointestinal and cardiovascular 
human toxicities, and the least was seen in 
cutaneous human toxicity. The results of this 
survey support the value of in vivo toxicology 
studies to predict for human toxicity associated 
with pharmaceuticals, and indicate that data 
collected from experiments in animals can be 
extrapolated to humans. It can also be con¬ 
cluded that the type of animal model chosen 
must be carefully evaluated. Traditionally, toxi¬ 
cology studies utilize rat and dog models, with¬ 
out considering whether there is an alternative 
species that might be more appropriate for 
testing a specific compound. While no animal 
model can completely recapitulate the effects 
of every drug administered to humans, previ¬ 
ous research has shown that large animals are 
better preclinical models for drug toxicity than 
rodents (Olson et a/., 2000). 


Modelling Human Disease in the Pig 

The pig has been used as an important large 
animal model for human disease for decades. 
The animal has a long lifespan of 10-15 years 
(Hau and Van Hoosier, 2003), so disease 
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progression is more similar to that seen in 
humans. Furthermore, as already discussed, 
the pig shares anatomical and physiological 
characteristics with humans that make it a 
unique and viable model for biomedical research 
(Tumbleson and Schook, 1996). Because of 
the similarity in body mass of pigs to humans, 
the pig has become a model of choice for tissue 
engineering and imaging studies (Lunney, 
2007). Their large size also makes them ideal 
models for study in such medical fields as sur¬ 
gery, imaging, chemotherapy and radiation, 
which cannot be accurately tested in small ani¬ 
mal models. 

Their cardiovascular anatomy and physiol¬ 
ogy, in combination with the pig’s response to 
atherogenic diets, have made them a univer¬ 
sally standard model for the study of athero¬ 
sclerosis, myocardial infarction and general 
cardiovascular studies. Their gastrointestinal 
anatomy has some significant differences from 
that of humans; however, the physiology of 
their digestive processes has made them a valu¬ 
able model for digestive diseases. The urinary 
system of swine is similar to humans in many 
ways, especially in the anatomy and function 
of the kidneys (Swindle and Smith, 2000). 
Swine are also a standard model for skin and 
reconstructive surgical procedures, and have 
been developed as models of transdermal tox¬ 
icity. The anatomy and physiology of organs 
such as the liver, pancreas, kidney and heart 
have also made this species the primary spe¬ 
cies of interest as organ donors for xenograft 
procedures (Swindle and Smith, 2000). 

In addition, the ability to use pigs from the 
same litter, and cloned or transgenic pigs, facili¬ 
tates genetic mapping (Lunney, 2007) and min¬ 
imizes immunological differences between 
animals in transplant studies. The availability of 
numerous well-defined cell lines from a broad 
range of tissues will assist in studies of gene 
expression and drug susceptibility testing. 
Sequencing of the swine genome (Schook et ai, 
2005a) has provided increasingly advanced 
genetic and proteomic tools for pigs. Many of 
these studies employ genomic approaches, as in 
heart, transplantation and melanoma models. 
The pig genome has a high sequence homology 
to humans, 60%, compared with a 40% 
sequence homology of rodents to humans 
(Thomas et a/., 2003; Humphray et a/., 2007), 
and the pig chromosomal structure has a higher 


similarity to humans than those of the mouse, 
rat, dog, cat, horse or cattle (Meyers et a/., 
2005; Murphy et a/., 2005). Each model will be 
affected by the availability of the functional 
genomic tools and swine genome sequence and 
maps (Rothschild et ai, 2007; Tuggle et a/., 
2007). 


Creating a Porcine Cancer Model 

The pig is an attractive model to study cancer 
biology and to help close the gap between basic 
science and patient benefit. Compared with 
rodents, the pig metabolizes drugs and under¬ 
goes tumorigenesis in a manner analogous to 
humans. Like humans, the incidence of cancer 
in pigs is rare, with a prevalence of childhood 
cancer - Wilm’s tumours in young pigs (Anderson 
and Jarrett, 1968) - and a broader spectrum of 
cancers in adults (Brown and Johnson, 1970). 
Furthermore, the pig provides an ideal system 
for preclinical studies of imaging, as well as of 
hyperthermia, radiation or photodynamic ther¬ 
apy of tumours. It is almost impossible to do 
intensity-modulated radiation therapy on mice 
owing to the small tumour size and the energy of 
the clinical accelerator. High-resolution intensity 
treatment in other rodents is hindered by the 
same problems, and devices used for hyperther¬ 
mia treatment of tumours cannot be scaled down 
to be useful for studies in rodents. 

Parallels in cancer biology between pigs 
and humans extend to the molecular level, as 
demonstrated by the reduced number of genes 
required to convert human and pig cells to a 
tumorigenic state compared with mouse cells 
(Kendall et ai, 2005). Additionally, telomerase 
is suppressed in a number of tissues and reacti¬ 
vated during cancer in both humans and pigs 
(Pathak et a/., 2000; Stewart and Weinberg, 
2000), indicating that there are also similarities 
in the process of tumorigenesis between the 
species. The genomic sequence homology 
between pigs and humans is also very high 
(Swanson et a/., 2004), and the porcine preg¬ 
nane X receptor protein that regulates p450 
cytochrome CYP3A, which metabolizes almost 
half of prescription drugs in humans, is more 
similar to that of humans than, for example, 
mice (Xie and Evans, 2002; Pollock et a/., 
2007). 
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It has been demonstrated that the enforced 
expression of transgenes that mimic genetic 
changes occurring in many types of human 
cancers can drive normal primary porcine 
cells to a tumorigenic state. Specifically, co¬ 
expression of human TERT ( hTERT ), p53 DD 
(a dominant-negative truncation mutant of 
p53), cyclin Dl, CDK4 R24C (an activated ver¬ 
sion of a cyclin-dependent kinase 4 mutant), 
c-Myc T58A (a stabilized version of the oncogene 
c-Myc) and H-Ras G12y (a constitutively active 
form of Ras GTPase) have the ability to drive 
porcine fibroblasts to form tumours when 
explanted into immunocompromised pigs at 
different anatomical sites (Adam et a/., 2007). 
These same genetic changes drive human kid¬ 
ney cells, mammary epithelial cells and myo¬ 
blasts to a tumorigenic state (Kendall et a/., 
2005), indicating that tumorigenesis in pigs is 
similar to the process in humans. Genetically 
engineered porcine tumour cells provided the 
first method of inducing tumours in a large ani¬ 
mal, and hence it is possible to tailor-make 
tumours of a defined background using the pig. 
Although this model is limited because the ani¬ 
mals need to be immunosuppressed for tumours 
to grow (akin to xenograft mouse models), pigs 
nevertheless have a number of clear advantages 
that make them ideal for preclinical studies of 
human cancers. The resultant tumours in the 
pigs could be grown to very large sizes, ideal for 
a number of preclinical applications. This model 
can be exploited in different cell types to gener¬ 
ate many different types of tumours potentially 
anywhere in the body (Table 17.4). 

Emerging Cancer Models Utilizing 

the Pig Phenotype 

Basal cell carcinoma is the most prevalent human 
cancer, with over 750,000 cancers being diag¬ 
nosed yearly in the USA alone, yet animal mod¬ 
els remain limiting owing to molecular and skin 


type differences between humans and mice. 
While mouse skin and human skin share many 
similar features, there are also major differences, 
which may contribute to the differences in skin 
tumorigenesis with respect to tumour type and 
mechanism between the two species. In humans, 
the three main types of skin cancer are: basal cell 
carcinomas (BCC), squamous cell carcinomas 
(SCC) and cutaneous melanomas (CM), with 
BCC being the most common of the three, rep¬ 
resenting approximately 70% of all human skin 
cancers (de Gruijl eta/., 2001). In contrast, mice 
do not develop BCC; the predominant malig¬ 
nant tumour type in mice is SCC (Feto et a/., 
1975; Bogovski, 1994). In addition, oncogenic 
Ras has an essential role in mouse skin tumori¬ 
genesis while it appears to have only a minor 
role in human skin cancer (Ananthaswamy and 
Pierceall, 1990; Pierceall et a/., 1991a,b). Thus, 
mice are not always ideal in vivo models for the 
study of human skin cancer. 

Among experimental animals, porcine 
skin is most similar to human skin and has been 
used extensively as a model of human wound 
healing (Lunney, 2007). More specifically, the 
porcine integument is morphologically 
(Montagna and Yun, 1964; Meyer et a/., 1978; 
Monteiro-Riviere and Stromberg, 1985; 
Monteiro-Riviere, 1986), histochemically 
(Meyer etai, 1986; Rigal etai, 1991;Woolina 
et a/., 1991), biochemically and biophysically 
similar to human skin. As such, the pig has 
been utilized as a model for drug toxicity and 
percutaneous absorption studies. Pig skin 
resembles human skin in having a sparse hair 
coat, a relatively thick epidermis, and similar 
epidermal turnover kinetics, lipid composition, 
carbohydrate biochemistry, lipid biophysical 
properties and arrangement of dermal colla¬ 
gen and elastic fibres (Weinstein, 1966; 
Forbes, 1967; Montagna, 1967; Meyer et a/., 
1981,1982). Reported differences in pigs 
include a unique interfollicular muscle that 
spans the triad of the hair follicle (Stromberg 


Table 17.4. Porcine cell transformation. 


Embryonic layer Cell type transformed Experimental model Tumour type induced 

Endoderm Keratinocytes In vitro cell transformation N/A 

Ectoderm Fibroblasts; mammary, In vitro cell transformation Squamous cell carcinoma 

kidney and testes cells 
T cells 


Mesoderm 


Live virus injection 


T cell lymphoma 
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et a/., 1981), the presence of apocrine sweat 
glands only on the body surface (Montagna and 
Yun, 1964; Monteiro-Riviere and Stromberg, 
1985) and a thicker stratum corneum (Meyer 
et a/., 1978; Bronaugh et a/., 1982). With 
regard to biochemical similarities between pigs 
and humans, for example, conservation of the 
matrix metalloproteinase genes MMP1 and 
MMP9 is greater between humans and pigs 
(89% and 85%, respectively) than between 
humans and mice (80% and 78%, respectively), 
based on the HomoloGene NCBI (US National 
Center for Biotechnology Information) 
database). 

As discussed previously, it has been dem¬ 
onstrated that porcine fibroblasts can be trans¬ 
formed in vitro and explanted into the pig to 
form tumours. Fibroblasts, however, are the pri¬ 
marily transformed cell type in less than 1% of 
human malignancies (Khavari, 2006). BCC, the 
most common cancer in the USA, and SCC, 
the second most common cancer in the USA, 
arise from keratinocytes (Khavari, 2006). 
Isolated porcine keratinocytes, the target cell 
population, can be transformed following the 
same procedure. Specifically, the co-expression 
of hTERT , p53 DD , cyclin Dl, CDK4 R24C , 
c-Myc T58A and H-Ras G12V is sufficient to drive 
porcine keratinocytes to form tumours when 
injected subcutaneously into immunocompro¬ 
mised mice. Further research has demonstrated 
that expression of only cyclin Dl, CDK4 R24C , 
H-Ras G12V and c-Myc T58A was sufficient to trans¬ 
form both porcine fibroblasts and keratinocytes 
to a tumorigenic state, indicating that fewer 
genes are required for successful porcine cell 
transformation and subsequent tumour forma¬ 
tion (K.N. Kuzmuk, 2009, unpublished results). 

The establishment of tumours using the 
pig as a model is possible, provided the animals 
remain on immunosuppressive therapy. When 
treatment with immunosuppressive drugs is 
halted, tumours, regardless of size, regress 
owing to an overwhelming host immune reac¬ 
tion to the tumour cells. Research using retrovi¬ 
ruses as vectors is being conducted to determine 
whether this approach eliminates the need for 
immunosuppressed animals. It is theorized that 
the manipulation of cells in tissue culture during 
the transformation process makes the cells 
immunogenic. It has been demonstrated that 
the injection of a virus encoding mutated H-Ras 


directly into the mammary fat pads of wild-type 
rats is tumorigenic (McFarlin and Gould, 2003; 
McFarlin et a/., 2003). For that reason, the 
direct in vivo injection of retroviruses contain¬ 
ing the transgenes required for porcine cell 
transformation in vitro would be tumorigenic in 
immunocompetent pigs. To test this hypothe¬ 
sis, viruses expressing the transgenes used to 
transform both the porcine fibroblasts and 
keratinocytes (cyclin Dl, CDK4 R24C , H-Ras G12V 
and c-Myc T58A ) were injected directly into the 
pig. Direct retroviral injection produced a low 
frequency of lymphoma of T cell origin (K.N. 
Kuzmuk, 2009, unpublished results). 


Needs and Opportunities 
for Expanding the Use 
of Pig Biomedical Models 

Novel approaches to harvesting genomic infor¬ 
mation to target genetic manipulations coupled 
with cloning have been identified as targets for 
further development (Schook et ai, 2005b). 
Emerging technologies such as recombineer- 
ing and gene trapping combined with relevant, 
standardized cell lines of targeted modifications 
could be used for cloning specific pigs for a 
given human disease. The National Swine 
Resource and Research Center (NSRRC) at the 
University of Missouri (http://www.nsm:. 
missouri.edu) provides essential support for 
creating genetic pig models of human diseases. 
Specifically, NSRRC has established significant 
resources to assist researchers in creating 
transgenic pigs, as well as to support the distri¬ 
bution of created models to investigators, thus 
providing a mechanism for generating and dis¬ 
tributing the ‘gold standard’ model for specific 
diseases or phenotypes. 

Finally, the pig will continue to grow as 
the biomedical model of choice in bioengi¬ 
neering and experimental surgery, and in zoo¬ 
nosis research related to the emergence of 
new diseases such as swine influenza. With 
respect to bioengineering and experimental 
surgery, the growing popularity of the pig ver¬ 
sus the dog has continued to rise, and the pig 
is now the most common large laboratory ani¬ 
mal species. The number of pigs used in 2002 
in registered research facilities as reported to 
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the US Department of Agriculture (USDA) 
was over 68,400, whereas the number of 
dogs declined from 201,000 in 1984 to 
68,200 in 2002 (http://www.aphis.usda. 
gov/publications). Completion of the pig 
genome sequencing will only accelerate the 
popularity and value of swine in biomedical 
research. The pig is currently being developed 
as a model to understand the pathogenesis of 
and immunity to human viral pathogens such 
as rotavirus, calicivirus and coronavirus (CoV). 
Saif and co-workers (Costantini et a/., 2004) 
have clearly demonstrated the utility of the pig 
as a model to understand the mechanisms for 
‘super-spreaders’ and the atypical pneumonia 
and variable diarrhoea induced by the human 
CoV responsible for severe acute respiratory 
syndrome (SARS). The porcine model of 
SARS consists of utilizing the porcine respira¬ 
tory CoV (FRCV), a spike deletion mutant of 
the enteric CoV transmissible gastroenteritis 
virus (TGEV), which shows striking pathogen¬ 
etic similarities to the SARS CoV in its pri¬ 
mary replication in the lung. Further research 
is justified to compare known immunological 

Table 17.5. Evolution of animal models generated 


differences and similarities between mice, 
humans and pigs. Current work by Dawson 
et al. (2008) has revealed that pig immune 
responses are more similar to human 
responses than mouse responses for over 
80% of the variables compared, and that the 
mouse immune responses were more similar 
to human than pig responses in less than 10% 
of comparisons (Dawson et a/., 2008). 
Genomic tools will continue to push existing 
animal models to evolve and novel models to 
be developed (Table 17.5). 
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genomic tools. 


Characteristic features Traditional view Current view 


Future view 


Relevance to disease 


Practical considerations 


Unique features 


Ethical features 


Overall characteristics 


Anatomy, physiology, Disease characteristics 
pathology and and therapies or 

responses to devices tested 

therapeutics 

Dietary and housing Restricted to gene-rich 
requirements, species (worms, fruit fly, 

husbandry, genetic yeasts, rodents) 
uniformity and cost 


Selected based on specific 
disease and therapeutic 
responses 

Emerging genomic profiles of 
animals with similar disease 
phenotypes to humans 


Emergence of new 
technologies for gene 
manipulation; knock-in/ 
knockout; conditional 
gene activation 


Clear laws, 
regulations and 
policies 

Practical and 
economical but 
relevance to human 
phenotype may be 
questioned 


Pain and stress protocol 
issues 


Genetically similar but is 
phenotype similar? 


Recombineering multi-allelic 
substitutions; in vivo gene 
expression monitoring; 
enhanced phenotyping of 
disease progression; 
bioinformatics and 
predictive profiling 
Unknown issues in addition to 
use of new species for 
biomedical-regulated 
animal protocols 
Ideal owing to recapitulating 
human condition 
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Domestication and Early 
Development of Breeds 

Swine have been domesticated for at least 
9000 years (Larson et a/., 2007, Chapter 2). 
Domestication took place in at least two loca¬ 
tions (Asia and Europe) with some intermin¬ 
gling of the strains as humans started to migrate 
across the Europe-Asia landmass. Like all 
domesticated animals, modern pigs were devel¬ 
oped because the species (Sus scrofa) both 
received and provided a benefit in its relation¬ 
ship with humans (Homo sapiens). Unlike sev¬ 
eral other species, it was farmers who had 
established a location who assisted in this pro¬ 
cess rather than nomadic peoples. Nomadic 
peoples would be more likely to domesticate 
grazing animals which could be moved to loca¬ 
tions with good forage while established farm¬ 
ers would use domesticated animals in a variety 
of ways. 

It is likely that human attempts to ‘improve’ 
swine began soon after domestication. If an ani¬ 
mal has a utility, breeding those that appear to 
perform that utility in a superior way is natural. 
Pigs have progressed through being a consumer 


of the ‘leftovers’ of human society, to pigs that 
provided a high amount of lard, to today’s pig, 
which consumes a carefully constructed diet of 
harvested feeds in order to produce animal pro¬ 
tein as a source of human food as efficiently as 
possible. 

Because local areas have different envir¬ 
onments, different needs and different genetic 
stocks to initiate genetic improvement, it is not 
surprising that the pigs of the world are fairly 
diverse. Breeds are natural outgrowths of that 
diversity. Several aspects of domestication, as 
it applies to genetic diversity, are discussed in 
Chapter 2 of the previous edition of this book 
(Jones, 1998). It is generally agreed that pigs 
were domesticated to serve as a source of pro¬ 
tein for human consumption and because of 
the unique taste of pork. 

What Is a Breed? 

The term ‘breed’ is a difficult one to define 
precisely because it means different things to 
different people. A breed might be defined as 
a group of animals with similar physical 
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characteristics (such as colour, body type, 
etc.). However, there are breeds that contain 
wide variation in such characteristics, while 
the members of other, different, breeds 
may be quite similar. There is general agree¬ 
ment that the concept of a breed denotes 
common ancestry, yet some organizations 
that protect the purity of a breed choose, 
periodically, to open their herd books to ani¬ 
mals from exotic ancestry. Lush (1994), quot¬ 
ing from Lloyd-Jones (1915), makes the 
following observation: 

A breed is a group of domestic animals, 
termed such by common consent of the 
breeders, a term which arose among 
breeders of livestock, created one might 
say for their own use, and no one is war¬ 
ranted in assigning to this word a scien¬ 
tific definition and in calling the breeders 
wrong when they deviated from the for¬ 
mulated definition. It is their word and 
the breeders’ common usage is what we 
must accept as the correct definition. 

Wright (1977), in his description of breed 
formation, describes a breed as something that 
arises more rapidly than normal evolutionary 
processes would dictate, but more slowly than 
would be true in the laboratory. Breed develop¬ 
ment probably covers almost the entire range of 
rates in that spectrum. Some breeds arise almost 
entirely through natural forces (e.g. Chinese 
breeds of pigs, which developed in localized 
areas such as valleys where there were human 
communities). Other breeds and lines were 
developed by human managers in a highly 
directed fashion (e.g. breeds developed by the 
purposeful crossing of other breeds, research 
herds or company lines). 

Breeds are easy to recognize in many 
developed countries because organizations 
have arisen to protect the breed purity and to 
pursue its improvement. These ‘breed socie¬ 
ties’ originated in Great Britain during the 
early part of the 19th century (Willham, 
1987), and spread to other countries, most 
notably the USA. Some breed societies are 
large businesses with many thousands of pigs 
registered, while others number their annual 
registrations with three digits and are organ¬ 
ized by a single individual. The American 
Livestock Breeds Conservancy lists several 


pig breeds on their Threatened (<1000 
annual registrations) or Critical (<200 annual 
registrations) lists, including Gloucestershire 
Old Spot, Large Black and Red Wattle. 

Breed associations may periodically 
decide to open their herd books to allow reg¬ 
istration of individuals from another breed in 
order to bring some desired characteristics 
into the breed. In the USA, this has happened 
with the Poland China, Spots and Chester 
White breeds in recent years. Additionally, 
pigs that represent the breed but are from 
other countries are, at times, registered. 
This provides the benefit of bringing in new 
genetic material and reducing the effects of 
inbreeding. 


Identifying Breeds of Swine 

for Inclusion 

A difficulty associated with describing pig 
breeds is identifying which breeds to include 
in the discussion. Several hundred breeds 
have been identified (Mason, 1996), although 
many of these are a national derivative of a 
breed that is imported from its native country. 
It would be desirable to identify all of the 
‘important’ breeds around the world. This 
task is rendered nearly impossible because of 
the difficulty in defining ‘important’. Breeds 
with high census numbers are likely to be con¬ 
sidered important, but there may be breeds 
with low numbers which are important either 
historically or as a source of unique genetic 
material for some future use. This raises the 
issue of genetic conservation - or preserva¬ 
tion of genetic material. Breeds may be con¬ 
served for economic, scientific or cultural 
reasons (Committee on Managing Global 
Genetic Resources, 1993; Chapter 13). There 
are organizations that are devoted to the iden¬ 
tification and preservation of breeds that have 
declined in numbers (Rare Breeds Survival 
Trust, American Livestock Breeds 
Conservancy). These organizations identify 
breeds that are in danger of becoming extinct. 
There is value in preserving biodiversity 
through the preservation of breeds (Animal 
Genetic Resources Group, 2009). There is 
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also an effort to maintain a gene bank for 
European endangered breeds (Glodek, 2001; 
Labroue et a/., 2001a,c; Ollivier et a/., 
2001b), and the USDA National Animal 
Germplasm Program was established to effec¬ 
tively conserve and facilitate uses of animal 
genetic resources. 

Ideally, a description of breeds would 
include those breeds with well-understood origin 
and well-researched characteristics. However, 
many breeds are important, at least in some 
parts of the world, even though they fit neither 
of these characteristics. The origin of many 
breeds has been lost owing to inadequate his¬ 
torical records, or is irrelevant as a result of the 
large-scale introduction of individuals from out¬ 
side the breed. There is considerable research 
information for some breeds but little, or none, 
for others. 

We are left with only imperfect methods 
for identifying breeds to include in a publica¬ 
tion such as this. We have chosen the follow¬ 
ing approach. Breeds are listed if they were 
included in Briggs and Briggs (1980), or are 
described at the Breeds of Livestock web site 
maintained by Oklahoma State University 
(http://www.ansi.okstate.edu/breeds/ 
swine/). 

Previous Work on Breeds of Swine 

There have been many other summaries of 
the status and value of swine breeds in the 
world (Freedeen, 1957; Omtvedt, 1974; 
Sellier, 1976, 1988; Fahmy and Holtmann, 
1977a; Briggs and Briggs, 1980; Johnson, 
1981; Fahmy and Moride, 1983; Legault, 
1985; Sutherland et ai, 1985; Buchanan, 
1987; McLaren, 1990; Sellier and Rothschild, 
1991; Ollivier and Molenat, 1992; Jones, 
1998; Ollivier et a/., 2001a). All of these 
describe the considerable diversity that exists 
among existing breeds, and attempt to 
describe the value of breeds for commercial 
swine production. Additionally, there have 
been several published descriptions of the 
very interesting Chinese breeds of swine (Xue, 
1991; Mercer and Hoste, 1994; Mao, 1995). 
These Chinese breeds are of special interest 
because many of them are very highly prolific 


(Sellier and Legault, 1986; Bidanel et a/., 
1989a,b,c). 


Describing Breeds of Swine 

The breeds are described here in a series of 
tables and colour plates (Plates 4-27). In 
Tables 18.1-18.5 breeds are included that 
have been developed in various parts of the 
world. Where information is available, 
breeds are described for size, lean-to-fat 
ratio and prolificacy. The descriptors for 
some of the breeds have been derived from 
a review of breed comparison research in 
North America (Johnson, 1981) and from a 
French evaluation of the Meishan breed 
(Bidanel et a/., 1989a,b,c). Descriptors for 
the other breeds are highly subjective. They 
probably reflect performance that is depend¬ 
ent upon the environment in which the 
breeds are used, and may not indicate the 
performance levels that would be achieved if 
all of the breeds were managed in a uniform 
environment. 

Tables 18.6-18.9 are included to provide 
references for research information concern¬ 
ing the breeds. Numerous research papers, 
published in refereed journals, are included 
with a list of the breeds evaluated in the project 
described in each paper. The four tables divide 
the experiments into those that: (i) evaluate 
breeds of American or British origin; (ii) include 
breeds of Asian origin; (iii) include at least one 
European local breed; and (iv) include the 
Pietrain breed. 

There were intensive efforts at many 
state experiment stations in North America to 
compare breeds of swine during the 1960s, 
1970s and 1980s. This is easy to observe in 
Table 18.6. This was followed by increased 
interest in the highly prolific breeds from 
China, with much research done by scientists 
in North America, Europe and Asia (Table 
18.7). The Pietrain breed is interesting 
because of its very high ratio of lean to fat, 
and it has also been studied heavily (Table 
18.9). The allele for porcine stress syndrome 
(PSS) is also present in high frequency in the 
breed. 



Table 18.1. Breeds of pigs originating in North and South America. 


Breed 

Place of origin 

Breed origin 

Distribution 

Colour 

Ears 

Other 

Size and 
growth 3 

Lean-to-fat 

ratio 

Prolificacy 

Chester 

Pennsylvania, 

Yorkshire, 

North America 

White 

Droop 


Moderate 

Moderate 

High 

White 50 

USA 

Lincolnshire 








Duroc bd 

New Jersey and 

Jersey Red, 

North America, 

Red 

Droop 


Moderate 

Moderate 

Moderate 


New York, 

Berkshire, other red 

Europe 




to high 

to high 



USA 

pigs 








Hampshire 13 ' 6 

Kentucky, USA 

Old English 

North America, 

Black, white belt 

Erect 


Moderate 

Moderate 

Moderate 




Europe 




to high 

to high 


Hereford' 

Missouri, USA 

Duroc, Chester White, 

USA 

Red, white face, 

Erect 







Poland China 


belly, legs 






Lacombe b 

Alberta, Canada 

Landrace, Berkshire, 

Canada, USA, 

White 

Droop 

Composite breed 

Moderate 

Moderate 

High 



Chester White 

Mexico, 



developed by 







Europe 



Canadian Dept 










Agriculture 




Moura 

Brazil 

Duroc, Canastra, 

Brazil 

Blue roan 

Droop 

Rare 






Canastrao 








Mulefoot 

USA 


USA 

Black 

Droop 

Rare, single toed 




Ossabaw 

Ossabaw Island 

Descendants of 

Ossabaw Island 

Varied 

Erect 

Used for diabetes 

Low 

Low 


Island 

(off coast of 

Spanish pigs 




research, rare 





Georgia, USA) 

brought in 1500s 








Poland China 59 Ohio, USA 

Bedford, Berkshire, 

North America 

Black, six white 

Droop 


High 

Moderate 

Moderate 



others 


points 






Spotted 55 

Ohio, USA 

Poland China 

North America 

Black, white spots 

Droop 


Moderate 

Moderate 

Moderate 

Yorkshire 5 -^ 

Ohio, USA (from 

Large White 

North America, 

White 

Erect 


Moderate 

Moderate 

High 


England) 


Europe 







Pampa-Rocha k Uruguay 


South America 







Piau 1 

Brazil 


South America 








a Terms such as high, moderate and low are necessarily relative. For breeds that have been well studied, the terms reflect information derived from objective breed comparisons. For 
other breeds, the terms are more subjective. 
b Performance information inferred from Johnson (1981). 
c Plate 4; d Plate 5; e Plate 6; 'Plate 7; 9 Plate 8; h Plate 9; 'Plate 10. 

The Yorkshire breed originated directly from the British Large White. The intermingling of the breeds continues as pigs from the Large White breed are sometimes registered as 

Yorkshires. 

k Plate 11; 'Plate 12. 
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Table 18.2. Breeds of pigs originating in the UK. 


Breed 

Place 
of origin 

Breed origin 

Distribution 

Colour 

Ears 

Other 

Size and 
growth 3 

Lean-to-fat 

ratio 

Prolificacy 

Berkshire 15 

England 


North America, 

Black, six white points 

Erect 


Moderate 

Moderate 

Moderate 




Europe 







British Lop 

England 

White pigs of Wales, 

England 

White 

Droop 


Moderate 





Cumberland and Ulster 





to high 



Gloucestershire 

England 

Local breeds 

England 

White, black spots 

Droop 


Moderate 


High 

Old Spot 0 







to high 



Large Black 

England 

Devon, Cornwall and East 

England, USA, 

Black 

Droop 

Endangered 

Moderate 

Moderate 

Moderate 



Anglia 

South Africa, 










Australia 







Large White 

England 

Cumberland, Leicestershire, 

Europe, North 

White 

Erect 


Moderate 

Moderate 

High 



Middle and Small White 

America, Asia 







Middle White d 

England 


England 

White 

Erect 


Low 

Moderate 

Moderate 

Oxford Sandy 

England 

Berkshire, Tamworth, others 

England 

Sandy with black 

Erect 

Rare 

Moderate 

Moderate 

High 

and Black 




blotches 






Saddleback 

England 

Essex, Wessex 

England 

Black, white belt 

Erect 





Tamworth 6 

England 


England, North 

Red 

Erect 


Moderate 

Moderate 

Moderate 




America 





to high 



a Terms such as high, moderate and low are necessarily relative. For breeds that have been well studied, the terms reflect information derived from objective breed comparisons. For 
other breeds, the terms are more subjective. 
b Plate 13; c Plate 14; d Plate 15; e Plate 16. 
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Table 18.3. Breeds of pigs originating in Europe. 


Breed Place of origin Breed origin 


Distribution 


Angeln 

Germany 

Landrace, Wessex- 

Saddleback 5 


Saddleback 

Bazna 

Romania 

Berkshire, Mangalitsa 

Belarus Black 

Belarus 

Large White, Large Black, 

Pied 


Berkshire, Middle White, 

Bentheim 

Germany 

local breeds 

Berkshire, Cornwall, local 

Black Pied c 


breeds 

Black 

Croatia 

Berkshire, Poland China, 

Slavonian 


Black Mangalitsa 

Bulgarian 

Bulgaria 

Large White, Edelschwein, 

White 


native breeds 

Czech 

Czech Republic Large White, Edelschwein, 

Improved 


Landrace 

White 

Dermantsi 

Bulgaria 

Berkshire, Mangalitsa, local 

Pied 


breeds 

Iberian 

Spain 

Extremadura Red, Jabugo 

Krskopolje 

Slovenia 

Spotted, Black Iberian 
Suffolk, Carniola, Berkshire, 

Laconie 


Yorkshire 

Hampshire, Pietrain, Large 

Landrace d 

Denmark 

White 

Large White, native breeds 


Europe 

Europe 

Belarus 

Germany 

Croatia 

Bulgaria 

Czech 

Republic 

Bulgaria 

Spain 

Slovenia 

North America, 
Europe 





Size and 

Lean-to-fat 


Colour 

Ears 

Other 

growth 3 

ratio 

Prolificacy 

Black, white belt 

Droop 

Nearly extinct 

Moderate 

Low 


Black, white belt 
Usually black 

Droop 


Moderate 

Low 

Moderate 

White, black spots 

Droop 

Nearly extinct 

Moderate 

Low 

Moderate 






to high 

Black 

Droop 

Nearly extinct 




White 






White 






Black and white 






Varies 

Droop 





Black, white belt 

Droop 

Rare 

Low 

Low 

Low 




Moderate 

Moderate 

Moderate 

White 

Droop 

Many strains 

Moderate 

Moderate 

High 



in different 
countries 
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Lithuanian 

Lithuania 

Local breeds 

Lithuania 

Spotted, varied 

Erect 

Rare 

Moderate 

Low to 

Moderate 

Native 




colours 




moderate 

to high 

Mangalitsa 6 

Austria- 

Local breeds 

Europe 

Varied 

Erect 






Hungary 









Mora 

Ravenna 

Indigenous breed of region 

Italy 

Dark brown 

Droop 


Moderate 

Low 

Low 

Romagnola 

province, Italy 









Pietrain f 

Belgium 


Europe 

White, black spots 

Erect 


Moderate 

Very high 

Moderate 

Swabian-Hall 

Germany 


Germany 

White, black head 

Droop 


Moderate 

Moderate 

Moderate 

Swine 




and rump 






Turopolje 

Croatia 

Local breeds, Siska, 

Croatia 

White, black spots 

Droop 

Nearing 

Low 

Low 

Low 



Krskopoljski 




extinction 





a Terms such as high, moderate and low are necessarily relative. For breeds that have been well studied, the terms reflect information derived from objective breed comparisons. For 
other breeds, the terms are more subjective. 
b Plate 17; c Plate 18; d Plate 19; e Plate 20; 'Plate 21. 



Table 18.4. Breeds of pigs originating in Asia. 


Breed 

Place of origin 

Breed origin 

Distribution 

Ba Xuyen 

Vietnam 

Berkshire, Chinese 
breeds 

Asia 

Beijing Black 

China 

Berkshire, Large White, 
local breeds 

China 

Cantonese 

China 


China 

Da Min b 

China 


China 

Fengjing 0 

China 


Asia, exported 
to USA 

Hezuo 

China 


China 

Jinhua 

China 


China 

Kele 

China 


China 

Luchuan d 

China 


China 

Large 

Black-White 

China 


China 

Meishan e ’ f 

China 

Local breeds in Taihu 
Lake region 

China, North 
America 

Minzhu 

China 

Local breeds in 
northern China 

China, North 
America 

Mong Cai 9 

Vietnam 

Local breeds in 
northern Vietnam 

Vietnam 


Neijiang 


China 


China 





Size and 

Lean-to-fat 


Colour 

Ears 

Other 

growth 3 

ratio 

Prolificacy 

White, black spots 

Erect 


Low 

Low 

Low 

Usually black 






Black and white 






Black 

Droop 

Imported to 

Low 

Very low 

Very high 



USA in 1989 

Low 


Low 

White, black head and 

Droop 




High 

rump 

Droop 


Low 

Very low 

Low 

White, black spots 

Droop 


Low 

Low 

Very high 

Black, white points 

Droop 

Imported to 

Low 

Very low 

Very high 



USA in 1989 




Black 

Droop 

Imported to 

Low 

Very low 

Very high 



USA in 1989 




Black and white 

Erect 


Low 

Low 

High 


Droop 

Very early 

Low 

Very low 

Moderate 


sexual 

maturity 
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Ningxiang 

China 

Local breeds of Hunan 

China 

Black and white 

Droop 


Low 

Very low 

Moderate 



Province 








Thuoc Nhieu 

Vietnam 

Bo Xu, Yorkshire 

Vietnam 

White 

Erect 


Low to 

Low 

Moderate 








moderate 



Tibetan 

Tibet 


China 

Black 

Erect 

Adapted to cold, 

Low 

Low 

Low 







pasture 

environment 




Tongcheng h 

China 


China 







Vietnamese 

Vietnam 

Breed of Vietnam 

Europe, North 

Black and white 

Erect 

Imported in 

Very low 



Pot Belly 



America 



1986 as pets 





a Terms such as high, moderate and low are necessarily relative. For breeds that have been well studied, the terms reflect information derived from objective breed comparisons. For 
other breeds, the terms are more subjective. 
b Plate 22; c Plate 23; d Plate 24; e Plate 25. 

'Information for Meishan derived from Bidanel et al., 1989a,b,c. 
g Plate 26; h Plate 27. 
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Table 18.5. Breeds of pigs originating in Africa and Oceania. 


Breed 

Place of origin 

Breed origin 

Distribution 

Colour 

Ears 

Other 

Size and 
growth 3 

Lean-to-fat 

ratio 

Prolificacy 

Arapawa Island 
Bantu 

Guinea 

Kunekune 

New Zealand 

South Africa 

Guinea coast of Africa 

New Zealand 

From introduced pigs 

New Zealand 

New Zealand 

Varied 

Usually brown 
Black 

Varied 

Erect 

Feral 

Rare 

Tassels on lower jaw 

Low 

Low 

Low 

Low 

Low 

Mukota 

Rhodesia, Zimbabwe 

in early 1800s 

From pigs introduced 

Africa 

Black 


Adapted to harsh 

Low 

Low 

Low 

Red Wattle 

New Caledonia 

by European and 
Chinese traders 

North America 

Red 

Erect 

tropical environment 

Wattles on jaw 

Low 

High 



a Terms such as high, moderate and low are necessarily relative. For breeds that have been well studied, the terms reflect information derived from objective breed comparisons. For 
other breeds, the terms are more subjective. 
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Table 18.6. References for research information comparing American and British breeds. 

Reference(s) Location Breeds 


Dufourand Fahmy, 1975 


Agriculture Canada 


Hampshire, Lacombe, Landrace, Yorkshire 


Fahmy, 1972; Fahmy etal., 1975, 1976, 1978 


Agriculture Canada 


Jeremiah etal., 1999 
Luiting etal., 1995 

Kuhlers etal., 1980, 1981, 1982, 1985, 1989a,b; Jungst and 
Kuhlers, 1984 

Jones etal., 1980; Richmond etal., 1979 
Bereskin and Davey, 1976; Davey and Bereskin, 1977; 

Bereskin and Steele, 1986 
Park and Yi, 2002 

Cameron, 1990; Cameron etal., 1990 
Kennedy and Conlon, 1978 
Ruusunen and Puolanne, 1997 
Zheng etal., 2002 
Baas etal., 1992a,b 
Meeker etal., 1985, 1987 
Schneider etal., 1982a,b 
Zhang etal., 2007 


Agriculture Canada 
Agricultural University of Norway 
Alabama, USA 

Alberta, Canada 
BARC, a USA 

Daejeon, Korea 
Edinburgh, UK 
Guelph, Canada 
Helsinki, Finland 
Hunan, China 
Iowa, USA 
Iowa, USA 
Iowa, USA 
Iowa, USA 


Shu, 1996 Jiangxi, China 

Fahmy and Holtmann, 1977a,b; Langlois and Minvielle, 1989a,b Laval University, Canada 
Holtmann etal., 1975 Laval University, Canada 


Purchas etal., 1990; Smith etal., 1988, 1990 Massey University, New Zealand 

Suzuki and Watanabe, 1996; Suzuki etal., 2001 Miyagi, Japan 

Suzuki etal., 2003 Miyagi, Japan 

Lishman etal., 1975 Newcastle, UK 

Neely etal., 1980; Neely and Robison, 1983;Toelle and North Carolina, USA 

Robison, 1983 

Nelson and Robison, 1976 North Carolina, USA 

Neal and Irvin, 1994 Ohio, USA 


Berkshire, Duroc, Hampshire, Lacombe, Landrace, Large 
Black, Yorkshire 

Duroc, Hampshire, Landrace, Yorkshire 
Duroc, Landrace 

Duroc, Hampshire, Landrace, Spot, Yorkshire 

Duroc, Hampshire, Lacombe, Yorkshire 
Duroc, Yorkshire 

Duroc, Yorkshire 
Duroc, Landrace 
Duroc, Hampshire, Landrace 
Hampshire, Landrace, Yorkshire 
Duroc, Landrace, Large White 
Hampshire, Landrace 
Duroc, Landrace, Yorkshire 
Chester White, Duroc, Hampshire, Yorkshire 
Berkshire, Chester White, Duroc, Hampshire, Landrace, 
Poland China, Spot 
Duroc, Landrace, Large White 
Duroc, Hampshire, Landrace, Yorkshire 
Berkshire, Duroc, Hampshire, Lacombe, Landrace, Large 
Black, Tamworth 

Duroc, Hampshire, Landrace, Large White 
Duroc, Landrace, Large White 
Berkshire, Duroc 

Hampshire, Large White, Landrace 
Duroc, Yorkshire 

Duroc, Hampshire, Yorkshire 
Large White, Yorkshire 

Continued 
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Table 18.6. Continued. 


Reference(s) 

Location 

Breeds 

Shurson and Irvin, 1992 

Ohio, USA 

Duroc, Landrace 

Yen eta!., 1987 

Ohio, USA 

Chester White, Duroc, Hampshire, Landrace, Spot, 
Yorkshire 

Hutchens etal., 1982; Fent etal., 1983; Buchanan and 

Oklahoma, USA 

Duroc, Landrace, Spot, Yorkshire 

Johnson, 1984; Gaugler etal., 1984; 

McLaren etal., 1987a,b,c,d 

Johnson and Omtvedt, 1973, 1975; Johnson 

Oklahoma, USA 

Duroc, Hampshire, Yorkshire 

etal., 1973, 1978;Young etal. 1976a,b; 

Wilson etal., 1977; Wilson and Johnson, 1981a,b 

Goenaga and Carden, 1979 

Pergamino, Argentina 

Duroc, Hampshire, Landrace 

Kim etal., 1983 

Seoul National University, Korea 

Duroc, Hampshire, Landrace, Spot, Yorkshire 

Kim etal., 2007 

Seoul National University, Korea 

Berkshire, Duroc, Landrace, Yorkshire 

Tan and Chen, 1983 

Singapore 

Duroc, Hampshire, Landrace, Poland China, Yorkshire 

Lee etal., 1986 

Suweon, Korea 

Lacombe, Large White 

Edwards etal., 1992 

Terringtong, UK 

Duroc, Large White 

Chang etal., 1998 

TLRI, b Taiwan 

Berkshire, Duroc, Landrace, Yorkshire 

Martinez Gamba et al., 2006 

Universidad Nacional Autonoma de 
Mexico 

Duroc, Landrace, Yorkshire 

Blasco et al., 1994 

Universidad Politecnica de Valencia, 
Spain 

Duroc, Landrace, Large White 


a Henry A. Wallace Beltsville Agricultural Research Center, Maryland (US Department of Agriculture Agricultural Research Service). 
b Taiwan Livestock Research Institute. 
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Table 18.7. References for research information that includes a Chinese breed. 


Reference(s) 


Location 


Breeds 


Xu etal., 1998 
Kouba etal., 1997 
Bazer etal., 1988 
Xu etal., 2009 

Gerfen etal., 1994 
Wolter etal., 2000 
Zou etal., 1992 

Bidanel, 1990; Bidanel etal., 1990a,b, 1991 
Canario etal., 2006, 2009; Laloe etal., 2006 
Geverink etal., 2006; Foury etal., 2007 

Legault and Caritez, 1982; Legault etal., 1982, 1984; Rombauts etal., 1982 
Legault etal., 1985 

Bidanel, 1989, 1993; Bidanel etal., 1989a,b,c, 1993; Despres etal., 1992; 
Mandonnet etal., 1992;Terqui etal., 1992; Camara etal., 1994; Downey 
and Driancourt, 1994; Reviers etal., 1997; Driancourt etal., 1998; Mourot 
and Kouba, 1999; Lefaucheur and Ecolan, 2005 
Touraille etal., 1989 
Vrillon and Caritez, 2000 
Halbur etal., 1998 

Youngs etal., 1993; Biensen etal., 1998; Vonnahme etal., 2002 
Gispert etal., 2007 
Franck etal., 1998 

Biggs etal., 1993; Flunter etal., 1993; Hunter, 1994 
Lee etal., 1994 

Wilmut etal., 1992; Ashworth etal., 1994 


Alberta, Canada 
ENSAR, a France 
Florida 

Huazhong Agricultural 
University, China 
Illinois, USA 
Illinois, USA 
Illinois, USA 
INRA, b France 
INRA, France 
INRA, France 
INRA, France 
INRA, France 
INRA, France 


INRA, France 
INRA, France 
Iowa, USA 
Iowa, USA 
IRTA, C Spain 
Lyon, France 
Nottingham, UK 
Roslin, UK 
Rowett Institute, UK 


Large White, Meishan 
Large White, Meishan 
Large White, Meishan 
Large White, Meishan 

Fengjing, Meishan, Yorkshire 
Duroc, Landrace, Meishan, Yorkshire 
Hampshire, Landrace, Meishan, Yorkshire 
Jiaxing, Jinhua, Meishan, European breeds 
Duroc, Laconie, Large White, Meishan 
Duroc, Landrace, Large White, Meishan, Pietrain 
Mizxing, Kinhwa, Meishan 
Jiaxing, Jinhua, Large White, Landrace, Meishan 
Large White, Meishan 


Jia Xiu, Large White, Meishan, Pietrain 
Meishan, Jiaxing, Jinhua, European breeds 
Duroc, Hampshire, Meishan 
Meishan, Yorkshire 

Duroc, Large White, Landrace, Meishan, Pietrain 
Duroc, Large White, Meishan, Pietrain 
Landrace, Large White, Meishan 
Large White, Meishan 
Landrace, Large White, Meishan 

Continued 
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Table 18.7. Continued. 


Reference(s) 

Location 

Breeds 

Kim etal., 2001 

Seoul National University, 
Korea 

Landrace, Large White, Meishan 

Yen etal., 2001 

Taichung, Taiwan 

Duroc, Taoyuan 

Tzeng, 1991 

Taiwan Livestock Research 
Institute 

Landrace, Taoyuan 

Young, 1994 

USMARC, d USA 

Duroc, Fengjing, Meishan, Minzhu 


a Ecole Nationale Superieure Agronomique de Rennes. 
b lnstitut National de la Recherche Agronomique. 
c lnstitut de Recerca i Tecnologia Agroalimentaries. 
d US Meat Animal Research Center. 
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Table 18.8. References for research information that includes a local breed. 


Reference(s) Location 


Breeds 


Franci et al., 2001 Dipartimento di Scienze Zootecniche, Italy Italian local breeds, Large White 

Glodek et at., 2001 Gottingen, Germany German local breeds, Pietrain 

Labroue etal., 2000, 2001b ITP, a France French local breeds, Large White 

Gourdine et al., 2006 INRA, b France Creole, Large White 

Barba et al., 2001 Universidad de Cordoba, Cordoba, Spain Spanish and Iberian local breeds 


a lnstitut Technique du Pore. 

b lnstitut National de la Recherche Agronomique. 


Table 18.9. References for research information that includes Pietrain. 


Reference(s) 

Location 

Breeds 

Castaing et al., 2002 

ADAESO, 3 France 

Duroc, Large White, Pietrain 

Matthes et al., 2008 

Berlin, Germany 

Duroc, Pietrain 

Wood and Lister, 1973; Moody et al., 1978 

Bristol, UK 

Large White, Pietrain 

Davies, 1974a,b,c; King et al., 1975 

Edinburgh, UK 

Landrace, Pietrain, Yorkshire 

Glodek et al., 2001 

Gottingen, Germany 

German local breeds, Pietrain 

Bout et al., 1988; Sellier et al., 1988; 

INRA, b France 

Landrace, Large White, Pietrain 

Caugant etal., 1989; Gueblez etal., 

1993a,b; Aubry et al., 2001 

Legault et al., 1987; Monin et al., 2003; 

INRA, France 

Large White, Pietrain 

Tiran et al., 2003 

Touraille et al., 1989 

INRA, France 

Jia Xiu, Large White, Meishan, 
Pietrain 

Gispert et al., 2007 

IRTA, C Spain 

Duroc, Large White, Landrace, 
Meishan, Pietrain 

Franck et al., 1998 

Lyon, France 

Duroc, Large White, Meishan, 
Pietrain 

Wolf et al., 2006 

Prague, Czech Republic 

Duroc, Hampshire, Large White, 
Landrace, Pietrain, Yorkshire 

Maurer etal., 1985; Young etal., 1989 

USMARC, d USA 

Chester White, Duroc, Hampshire, 
Landrace, Large White, Pietrain, 
Spot, Yorkshire 

Curran et al., 1972; Lean etal., 1972 

Wye College, UK 

Hampshire, Large White, 

Landrace, Pietrain 


Association pour le Developpement Agro-Environnemental du Sud-Ouest. 
b lnstitut National de la Recherche Agronomique. 
c lnstitut de Recerca i Tecnologia Agroalimentaries. 
d US Meat Animal Research Center. 

Breeding Companies 

The worldwide pork industry obtains much of 
its genetic stock from companies that provide 
seedstock and develop lines that fit into speci¬ 
fied breeding programmes (e.g. Babcock 
Genetics, Danbred, Fast Genetics, Genesus, 

Genetiporc, Hypor, Hermitage, JSR, 

Newsham Choice Genetics, PIC, TOPIGS). 

These companies develop their own lines, 


some of which are based on conventional 
purebreds, while others represent crosses of 
various breeds but have been kept closed for a 
sufficient period to be considered as independ¬ 
ent from their foundation breeds. The compa¬ 
nies generally sell a breeding programme with 
specialized sire and dam lines. The PIC 
Camborough gilt, which is a hybrid of Landrace 
and Large White, is very commonly used in 
commercial swine production throughout the 
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world. The genetic selection procedures and 
performance characteristics are generally kept 
private by the company, but there have been 
attempts by the industry to characterize the 
performance of some of the lines (Cassady etai, 
2002; Moeller et a/., 2004; Serenius et a/., 
2007). These company lines present a depar¬ 
ture from conventional thought about what 
constitutes a breed. However, the procedures 
used by the companies to preserve and 
improve their lines are very much in keeping 
with a simple modernization of the techniques 
used by purebred swine producers for many 
decades. Breeding companies continue to 
sample pigs from conventional breeds to iden¬ 
tify sources of new genetic material for their 
lines. 


Molecular Biology and Breeds 

of Pigs 

Modern molecular biology tools are now 
available to the swine industry. Genome map¬ 
ping efforts have made great progress in the 

USA (NSRF8, 2010; Rothschild, 2010; 
University of Illinois, 2010), the UK (Roslin 
Institute, 2010; Sanger Institute, 2010), 
France (INRA, 2010) and elsewhere. Other 
chapters in this book will address these 
efforts. In brief, progress is being made in 
identifying genes to assist in genetic improve¬ 
ment and aid in maintaining breed purity 
(Milan et a/., 1998; Bidanel et a/., 2000, 

2002, 2008; Feischl et a/., 2005). 

Molecular biology tools have led to 

greater understanding of domestication 
(Larson et a/., 2007) and breed diversity 
(Franceschi and Ollivier, 1981, Laval et a/., 
2000; Okumura et a/., 2001; Kim et a/., 
2002; Alves et a/., 2003; Ollivier et a/., 

2003, 2005; Yang et a/., 2003; Gongora 
et a/., 2004; Fang and Andersson, 2006; Fang 
et a/., 2006; SanCristobal et a/., 2006; Thuy 
et a/., 2006; Wu et a/., 2007; Jiang et a/., 
2008; Chang et a/., 2009; Ollivier, 2009). 
Data from the Roslin Institute Pig Biodiversity 
Project were used to construct a phylogenetic 
tree (Fig. 18.1). The tree includes one Asian 
breed (Meishan), and it clearly shows the 


WA 


considerable genetic distance between it and 
the breeds with origins in Europe. The pres¬ 
ence of the Meishan in the middle of the tree 
is surprising in view of the separate domesti¬ 
cation of pigs in Asia and Europe. However, it 
is also documented (Larson et a/., 2007) that 
Asian pigs were brought to Europe, and that 
genes from those pigs were incorporated into 
pigs which contributed to breeds that devel¬ 
oped in Europe. For example, the Berkshire 
breed has been shown to have some genetic 
elements that are similar to those in several 
Chinese breeds (Wu et a/., 2007). As this 
phylogenetic tree is examined, it is important 
to remember that when the entities in the tree 
are breeds, interpretation is different from 
when the entities are species. Speciation fre¬ 
quently occurs by a single species subdividing 
into two or more species. A phylogenetic tree 
illustrates those speciation events. Breed 
development occurs through a process of 
crossing representatives of other genetic 
stocks and the periodic insertion of genes 
from other breeds. As a result, a phylogenetic 
tree of breeds represents degrees of genetic 
similarity for the markers used in the research 
to evaluate genetic distance. However, as 
breeds draw genetic input from a variety of 
sources, a phylogenetic tree that truly repre¬ 
sents breed development would have multiple 
connections instead of single branch points. 


Summary 

The concept of a breed is likely to remain 
rather fluid. It is tempting to assume that the 
important breeds of today will continue to be 
important in the future. One has only to exam¬ 
ine the history of breeds during the 20th cen¬ 
tury, in swine and in other species, to see the 
fallacy of this assumption. The popularity of 
the Berkshire, followed by a decline in num¬ 
bers but a recent resurgence, is an example. 
Improved techniques for the identification of 
superior genetic material, including techniques 
from molecular biology, will be likely to speed 
the evolutionary pace in swine. This will mean 
even more rapid assembly and recombination 
of genetic stocks. 
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Fig. 18.1. Phylogenetic tree of breeds. Data adapted from the pig biodiversity database from the Roslin 
Institute (http://projects.roslin.ac.uk/pigbiodiv/). The phylogenetic tree was built using the program phylip 
(Felsenstein, 2004). 
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Introduction 

Genetics includes the study of genotypes, phe¬ 
notypes and the mechanisms of genetic con¬ 
trol between them. Genetic terms describe the 
processes, genes and traits with which genetic 
phenomena are described and examined. The 
genetic process terminologies are thoroughly 
discussed in the previous chapters. Therefore, 
in this chapter, we will only list the terms for 
genetic processes and concepts in Appendix I 
(a general genetic glossary), and concentrate 
the discussion on pig gene and trait terminolo¬ 
gies (Appendix II); a glossary for pig diseases 
and defects is also included (Appendix III). 

A standardized genetic nomenclature is 
vital for unambiguous concept description, 
efficient genetic data management and effec¬ 
tive communications among not only scien¬ 
tists, but also those who are involved in the pig 

©CAB International 2011. The Genetics of the Pig, 
2nd Edn (eds M.F. Rothschild and A. Ruvinsky) 


production and genetic improvement indus¬ 
tries. This issue becomes even more evident in 
the post-genomics era, owing to the rapid 
accumulation of large quantities of genetic and 
phenotypic data, and the use of computer 
software to manage such data. This imposes a 
challenge for the precise definition and inter¬ 
pretation of gene and trait terms. 

For example, Myostatin (gene MSTN) is 
also known as Growth and Differentiation 
Factor 8 (gene GDF8) (one can also find inap¬ 
propriate abbreviations such as GDF-8 in the 
literature) and is referred to as the ‘muscle 
hypertrophy’ locus in cattle. While all these 
names are interchangeably used in the litera¬ 
ture, it gets more complicated when one con¬ 
siders paralogous gene duplications across 
species, which led Rodgers et a/. (2007) to 
propose MSTN-1 and MSTN-2 as names. 
Unfortunately, this naming scheme does not 
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follow Human Gene Nomenclature Committee 
(HGNC) guidelines, which would indicate that 
they should be named MSTN1 and MSTN2. 

In terms of traits, an example that would 
benefit from consistent nomenclature is the 
longissimus dorsi muscle area, which is also 
referred to as the loin eye area (LEA), loin mus¬ 
cle area (LMA), meat area (MLD), ribeye area 
(REA), etc. Each of these is known by different 
individuals as the default name for the trait. 
Complexity is further increased by variation in 
anatomical locations, physiological stages and 
methods used to measure a given trait. This may 
seem manageable at first, but once one starts to 
compare data across different labs, publications 
or species, it quickly becomes very confusing. 

While we may not want to dictate to the 
community how genetic terms are defined, 
there are good reasons why all researchers 
need to adopt standardized genetic nomencla¬ 
tures. The emergence of the use of ontologies 
in biological research has contributed a new 
way to effectively use, standardize and manage 
genetics terms. The Gene Ontology (GO) con¬ 
sortium has provided a good example (The 
Gene Ontology Consortium, 2000). When 
genomics information must be transferred 
across species to perpetuate genetic discover¬ 
ies, the role of a standardized genetic nomen¬ 
clature becomes even more important. 

The goal of this chapter is to help estab¬ 
lish guidelines for nomenclature, with the hope 
that it will facilitate the comparison of results 
between experiments and, most importantly, 
prevent confusion. 

Locus and Gene Names and Symbols 

Locus name and symbol 

These guidelines for gene nomenclature are 
adapted and abbreviated from the HGNC Guide¬ 
lines (http: //www. genenames. org/guidelines. 
html). 

A ‘gene’ is a functional hereditary unit 
that occupies a fixed location on a chromo¬ 
some, has a specific influence on phenotype, 
and is capable of mutation to various allelic 
forms. In the absence of demonstrated function 
a gene may be characterized by sequence, tran¬ 
scription or homology. A ‘locus’, which is not 


synonymous with a gene, refers to a position in 
the genome that can be identified by a marker. 
A ‘chromosome region’ is defined as a genomic 
region that has been associated with a particu¬ 
lar syndrome or phenotype. 

Gene names and symbols will follow the 
human gene when 1:1 orthology is known. Gene 
names should be short and specific and convey 
the character or function of the gene. Gene 
names will be written using American spelling 
and contain only Latin letters or a combination of 
Latin letters and Arabic numerals. The first letter 
of a gene symbol should be the same as for the 
gene name. The symbol will consist of upper¬ 
case Latin letters and possibly Arabic numerals. 
Gene symbols must be unique. 

The locus name should be in capitalized 
Latin letters or a combination of Latin letters 
and Arabic numerals. If the locus name is two 
or more words, each word after the first word 
should be in capital Latin characters. The locus 
symbol should consist of as few Latin letters as 
possible or a combination of Latin letters and 
Arabic numerals. The characters of a symbol 
should always be capital Latin characters, and 
should begin with the initial letter of the name 
of the locus. If the locus name is two or more 
words, then the initial letters should be used in 
the locus symbol. The locus name and symbol 
should be printed in italics wherever possible; 
otherwise they should be underlined. 

When assigning gene nomenclature, the 
gene name and symbol should be assigned 
based on existing HGNC nomenclature where 
possible (e.g. 1:1 for swine:human orthologues). 
Ensembl (a joint project between EMBL-EBI 
(European Molecular Biology Laboratory- 
European Bioinformatics Institute) and the 
Wellcome Trust Sanger Institute to develop a 
software system that produces and maintains 
automatic annotation on selected eukaryotic 
genomes) has used the new EPO (Enredo, 
Pecan, Ortheus) pipeline (Paten et a/., 2008) 
for whole genome alignment of the swine 
genome. Initial efforts to provide information 
about genes predicted during the swine genome 
sequencing effort assigned standardized nomen¬ 
clature based on human gene nomenclature for 
2798 swine genes (http://www.ensembl.org/ 
Sus_scrof a/Inf o/StatsTable). 

There are two categories of novel swine 
genes: (i) novel genes predicted by bioinformatic 
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gene prediction programs; and (ii) novel swine 
genes that were studied before the completion 
of the swine genome. In cases where no strict 
1:1 human orthologue exists that has been 
assigned nomenclature, the LOC# or Ensembl 
ID should be used as a temporary gene symbol. 
In order to assign a name to a novel gene, it will 
need to be manually curated and assigned a 
unique name, following HGNC guidelines. 

Allele name and symbol 

These guidelines for allele nomenclature are 
adapted from Young (1998) and mouse 
genome nomenclature guidelines (http://www. 
inf ormatics. jax. org/mgihome/nomen/gene. 
shtml), in accordance with HGNC guidelines. 

The allele names should be as brief as pos¬ 
sible, yet still convey the variation associated 
with the allele. Alleles do not have to be named, 
but should be given symbols. If a new allele is 
similar to one that is already named, it should 
be named according to the breed, geographical 
location or population of origin. If new alleles 
are to be named for a recognized locus, they 
should conform to nomenclature established 
for that locus. The first letter of the allele name 
should be lower case. However, this does not 
apply when the allele is only a symbol. 

An allele symbol should be as brief as pos¬ 
sible and consist of Latin letters or a combina¬ 
tion of Latin letters and Arabic numerals. Like 
a gene symbol, an allele symbol should be an 
abbreviation of the allele name, and should 
start with the same letter. The allele name and 
symbol may be identical for a locus detected by 
biochemical, serological or nucleotide meth¬ 
ods. The wild-type allele can be denoted as a + 
(e.g. MSTN + ). Neither + nor - symbols should 
be used in alleles detected by biochemical, 
serological or nucleotide methods. Null alleles 
should be designated by the number zero. The 
initial letter of the symbol of the top dominant 
allele should be capitalized. All alleles that are 
codominant should have an initial capital letter. 
The initial letter of all other alleles should be 
lower case. A single nucleotide polymorphism 
(SNP) allele should be designated based on 
its dbSNP_id (http://www.ncbi.nlm.nih.gov/ 
projects/SNP/), followed by a hyphen and the 
specific nucleotide (e.g. MSTN rsl234567T ). If the 


SNP occurs outside an identified gene, the 
SNP locus can be designated using the dbSNP_ 
id as the locus symbol, and the nucleotide allelic 
variants are then superscripted as alleles (e.g. 

rsl234567 T ). 

The allele symbol should always be written 
with the locus symbol. Specifically, the allele 
symbol is written as a superscript following the 
locus symbol. For example, an SNP allele can 
be designated based on its dbSNPjd, followed 
by a hyphen and the specific nucleotide, as in 
MSTN rsl234567T . The allele symbol should be 
printed immediately adjacent to the locus sym¬ 
bol, with no gaps. The allele name and symbol 
should be printed in italics whenever possible, 
or otherwise be underlined. 


Genotype terminology 

The genotype of an individual should be shown 
by printing the relevant locus and allele symbols 
for the two homologous chromosomes con¬ 
cerned, separated by a slash, e.g. MSTN rsl234567 ~ T/ 
rsi234567~c' Unlinked loci should be separated by 

semicolons, e.g. CDll RsaI2400/220 °; ESR PvuII5700/ 
4200 . Linked or syntenic loci should be separated 
by a space or dash and listed in linkage order 
(e.g. POU1 FI^-STCH^-PRSS7 ^), or in 
alphabetical order if the linkage order is not 
known. For X-linked loci, the hemizygous case 
should have a /Y following the locus and allele 
symbol, e.g. /\R Eco57I1094 /Y. Likewise, Y-linked 
loci should be designated by /X following the 
locus and allele symbol. 

Trait and Phenotype Terminology 
Pig traits and trait ontology 

Pig traits are conventionally named based on 
their performance (e.g. average daily gain), 
physiological parameters (e.g. ovulation rate), 
anatomical locations/dissections (e.g. loin mus¬ 
cle area), physicochemical properties (e.g. 
muscle protein percentage, pH), livelihood 
soundness (e.g. immune capacity) and exterior 
appearance (e.g. coat color), etc. As such, 
there is a good chance that a trait will be named 
differently by different people, even within a 
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species community. Furthermore, traits have 
been studied across many species, which adds 
additional complexity to their naming. The 
study of the traits may also involve the study of 
underlying genes and markers, environments 
and management protocols that contribute to 
the manifestation of a trait. Therefore, it is 
obvious that factors that contribute to the nam¬ 
ing of a trait are multidimensional. As the 
amount of trait information that is associated 
with a gene or chromosomal region is growing 
exponentially, we cannot overemphasize the 
need for a standard nomenclature to be used 
by researchers to communicate as consistently 
and unambiguously as possible, with the aid of 
bioinformatics tools. 

Traits 

Fig trait terms can be found ubiquitously 
throughout journal articles, farm reports and 
daily communications among scientists and pig 
industry personnel. A trait term can be created 
by anyone, and each person may have a slightly 
different definition for said term, so hundreds 
of thousands of terms can be found in the lit¬ 
erature with various naming conventions used. 
Previously, there was no central repository 
where the uniqueness of a trait term could be 
maintained and checked, but two relatively 
recent database development efforts have now 
emerged: the Online Mendelian Inheritance in 
Animals (OMIA) database and the Animal Trait 
Ontology (ATO) database. 

OMIA was initiated in 1978. To date, it 
contains 214 pig trait variations and/or abnor¬ 
malities from pig genetic research publications 
(OMIA, 2009). ATO has a collection of over 
500 pig traits, including measurement method 
variations (ATO, 2009), of which 390 traits are 
associated with at least one quantitative trait 
locus (QTL). Appendices II and III contain a list 
of pig traits and a list of pig diseases and 
defects, respectively, which were mainly 
abridged from ATO and OMIA. In both cases, 
efforts have been made to make each database 
entry unique in terms of the names and their 
representations. ATO has been expanded to 
include trait data from cattle, pig and chicken, 
but was originally developed as part of 
the PigQTLdb (Pig QTL database, http:// 
www.animalgenome.org/QTLdb/pig.html, 
later known as Animal QTLdb when QTL data 


from multiple animal species were included), 
and the number of pig traits collected is more 
complete than that of cattle or the chicken. 
The initial purpose of ATO was to help with 
organization and management of trait informa¬ 
tion through the use of a controlled vocabulary 
to facilitate comparison of QTL experiment 
results and standardized trait data annotation 
and retrieval (Hu et a/., 2005, 2007), but it 
was later introduced to the community (Hughes 
et a/., 2008). 

‘Super-traits’ 

Compared with standard gene nomenclature, 
trait name standardization is far more complex, 
not only because the same trait can be named 
differently (e.g. ‘loin eye area’ versus ‘ribeye 
area’), but also because many factors contrib¬ 
ute to how a trait is defined under various 
circumstances. For example, Table 19.1 gives 
a list of 16 ‘backfat’ variations, each defined by 
their different measurement methods, measur¬ 
ing time and specific anatomical locations, 
which may contribute to trait comparison 
difficulties and, sometimes, confusion. 

One attempt to simplify the comparisons 
is by introducing the concept of ‘trait types’ or 
‘super-traits’ (Hu et a/., 2005). Hu et al. 
described trait type as a general physical or 
chemical property of, or the processes that 
lead to, or types of measurements that result 
in an observation (phenotype). The ‘trait 
types’ or ‘super-traits’ were initially used to 
serve as a general concept for a trait, regard¬ 
less of possible variations in the trait names by 
measurement times, locations or methods. As 
the ATO project progressed, the factors in the 
methods of trait measurements, such as point 
in time or timespan, anatomical locations, 
instruments, etc., were classified as ‘trait mod¬ 
ifiers’, because they do not constitute a com¬ 
ponent of a trait, but only affect the way a trait 
is described (authors’ unpublished data). 
Therefore, the ‘super-trait’ may only be 
employed to categorize variations in how a 
trait is defined or named. For example, ‘ribeye 
area’, ‘rib muscle area’, ‘longissimus dorsi 
muscle area’, ‘longissimus muscle area’, ‘loin 
eye area’, ‘loin muscle area’, etc. can be uni¬ 
fied as ‘longissimus dorsi muscle area (LMA 
and ‘backfat’, ‘backfat depth’, ‘backfat thick¬ 
ness’, ‘backfat above muscle dorsi’, ‘backfat 
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Table 19.1. An example of the trait variations by how they are measured and how this may 
contribute to trait comparison difficulties. 


Trait 


Measurement method 


Backfat thickness (average backfat) by ruler 
Backfat thickness (average backfat) by ultrasound 
Backfat thickness (average backfat) by Fat-O-Meter 
Backfat at first rib (first rib backfat) 

Backfat at first rib (measured at 14 weeks of age) 
Backfat at first rib (measured at 26 weeks of age) 
Backfat at last rib (measured at 14 weeks of age) 
Backfat at last rib (measured at 26 weeks of age) 
Backfat thickness at last rib 
Backfat at shoulder 
Backfat at 10th rib 
Backfat between 3rd and 4th rib 
Backfat between 6th and 7th rib 
Backfat depth at maximum muscle depth 
Backfat depth at last lumbar 
Backfat weight (dissected total weight) 



By methods 


By time 


By location 


intercept’, ‘backfat linear’, etc. can all simply 
be 'subcutaneous fat thickness ’. 

Trait hierarchy and ontology 

In order to compare QTLs across experiments, 
the PigQTLdb uses a trait hierarchy (Table 19.2) 
to provide a framework for organizing the traits 
and easily locating them (Hu et a/., 2005). 
This approach simplifies the procedures by 
which traits are defined, linked and compared. 
Subsequently, a computer program could be 
implemented to automatically process the data¬ 
base searches so that, when a user queries for 
a trait by keywords, the database can gather 
and retrieve related trait names and their asso¬ 
ciated QTLs, put them together and present 
them to the user in real time. 

However, people of different disciplines 
may see the need for a different trait hierarchy, 
which may better capture the subtleties required 
in their field. For example, for the body weight 
gained over a period of time (e.g. average daily 
gain, or ADG, a measure of body weight gain), 
a farmer considers it a production trait, a nutri¬ 
tionist may see it as an indicator for feed con¬ 
version efficiency and a veterinarian may find 
it a health status parameter. Similarly, blood 
cholesterol levels may be used to predict pork 
quality by pork producers, and may also be a 
parameter to predict coronary heart disease by 
those who use the pig as an animal model for 


Table 19.2. A simple pig trait class hierarchy used 
in PigQTLdb (Pig quantitative trait locus database). 


1. Meat quality 

1.1 

Fatness 


1.2 

Meat color 


1.3 

pH 


1.4 

Anatomy 


1.5 

Conductivity 


1.6 

Enzyme activity 


1.7 

Stiffening 


1.8 

Texture 


1.9 

Chemical 


1.10 

Flavor 


1.11 

Odor 


1.12 

Fat composition 

2. Production 

2.1 

Growth 


2.2 

Feed efficiency 


2.3 

Digestive organs 

3. Health 

3.1 

Pathogen 


3.2 

Immune capacity 


3.3 

Disease resistance 


3.4 

Blood parameters 

4. Exterior 

4.1 

Behavioural 


4.2 

Coat color 


4.3 

Conformation 


4.4 

Age 


4.5 

Defects 

5. Reproduction 

5.1 

Litter size 


5.2 

Reproductive organs 


5.3 

Reproductive traits 


5.4 

Endocrine 


human heart disease research. Therefore, a 
simple hierarchy may be helpful to solve prob¬ 
lems in some cases, but may be inadequate in 
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other cases. In addition, owing to the existence 
of multiple overlapping hierarchies for pig 
traits, the management of such data may be 
difficult. Thankfully, progress is being made to 
resolve these issues. 

Ontologies use controlled vocabularies to 
describe objects and the relationships between 
them in a formal manner. In an ontology, the 
Directed Acyclic Graph (DAG), a mathematical 
graphic modelling method, is used to solve 
data management problems with complex 
hierarchical structure. In the example shown in 
Fig. 19.1, the trait ‘marbling’ is shown to 
belong to the ‘meat quality’, ‘adipose trait’ and 
‘muscular system physiology’ hierarchies. 
Figure 19.2 shows how these traits are linked 
by their respective hierarchy paths. Computer 
tools have been developed and are freely avail¬ 
able to manage ontology data with DAG struc¬ 
tures. The two most popular tools, which are 
likely to be useful to the pig genetics commu¬ 
nity, are AmiGO and OBO-Edit (Gene Ontology 
Tools, 2009). AmiGO is an ontology browser 
adapted to the ATO database, which allows 
users to share and view trait data stored in ATO 
with any web browser on the Internet (ATO, 
2009). OBO-Edit is a Java-based ontology data 
editor, which can be used by anyone to edit 
ontology term definitions and relationships. 

ATO has been a successful project since its 
development from the QTLdb several years ago. 
Recently, the developers of ATO have begun 
working with Mouse Genome Informatics, the 
Rat Genome Database, the European Animal 
Disease Genomics Network of Excellence 
(EADGENE) and the French National Institute 
for Agricultural Research (INRA) to incorporate 
Mammalian Phenotype (MP) ontology and ATO 
into a unified Vertebrate Trait (VT) Ontology 
(2009). This project is aimed at enhancing the 
ability to standardize trait nomenclature within 
and across species. For example, a disease such 
as Atrophic rhinitis may have been considered 
a ‘trait’ in classical animal genetic studies. In fact, 
in terms of concept specifications, it is not a 
characteristic pig trait observable in the general 
population, but rather a morphological abnor¬ 
mality caused by a disease only in some pigs. In 
addition, a trait name may have variations 
because it is ‘modified’ by measurement time or 
method (Table 19.1), but these are actually the 
same trait. The separation of diseases from traits 
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product trait 
carcass yield 
egg morphology 
egg production rate 
meat quality 
© meat color 
© meat composition 
© meat ash level 
© meat moisture level 
© muscle lipid level 

© intramuscular fat percentage 
© intramuscular fat weight 
© marbling 


muscle cholesterol level 
© muscle protein level 
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□ 4 adipose tissue morphology 
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0 © fat distribution 
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0 © subcutaneous fat 
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0 4 muscular system trait 

0 4— i i muscular system morphology 
0 4- l muscular system physiology 

© adenosine deamination 
© muscle androstenone level 
0 © muscle conductivity 

0 © muscle contractility 

0 © muscle glycogen homeostasis 

© muscle indole level 
© muscle lactate level 
0 © muscle lipid level 

© intramuscular fat percentage 
© intramuscular fat weight 
© marbling 


Fig. 19.1. Example of how traits are organized in 
the Animal Trait Ontology (ATO) database. A trait 
term, for example marbling, may belong to multiple 
trait class hierarchies, as shown in an AmiGO 
browser (http://www.animalgenome.org/cgi-bin/ 
amigo/browse.cgi). 


(Appendices II and III) reflects the efforts towards 
a well-defined and standardized trait nomencla¬ 
ture. Standardization of the trait nomenclature 
will undoubtedly help the pig genome commu¬ 
nity make meaningful trait comparisons, and 
also facilitate the transfer of genomics informa¬ 
tion from some frequently studied species. The 
challenge of using ontologies to standardize 
and manage trait nomenclature is not only a 
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Fig. 19.2. An OBO-Edit graphic view showing that multiple relationship paths may exist between a child 
term and its ancestral term. The symbol T means that the relationship type is 7s_a’; The symbol ‘P’ means 
that the relationship type is ‘part_of\ 


technical issue, but a community issue, in the 
sense that it has to be commonly recognized, 
mutually agreed upon and widely shared. 


Trait nomenclature recommendations 

Currently, pig traits are named in many different 
ways, ranging from the very abbreviated to 
other names that are verbally lengthy and self- 
descriptive. We suggest a trait name be chosen 
to represent that which is measured on an indi¬ 
vidual animal. We have compiled and edited 
known pig traits into a list in Appendix II. We 
strongly recommend users to refer to the special¬ 
ized online databases for the most up-to-date 
reference, as far as the use of trait terms is con¬ 
cerned. While it might not be realistic to enforce 
a naming guideline without a community consen¬ 
sus, we recommend the following steps to take 
or factors to consider when naming a trait: 

1. Check against existing databases and litera¬ 
ture, to make sure a trait term does not previ¬ 
ously exist. Good places to start are online 
databases such as ATO, OMIA, AGRICOLA 
and PubMed. Checking journals in the field 
(e.g. Journal of Animal Science ), and in 
closely related fields (e.g. Meat Science ), is 
essential too. Sometimes, reviewing similar 
names also helps to improve ideas. 


2. Try to be informative. Although more 
details can be given in the ‘description’ of the 
trait, the name itself should already carry most 
of the conceptual information of that trait. 

3. Try to be brief. One or a few key nouns, or 
a short noun phrase, may be good candidates. 
Sentences should be avoided (i.e. no verb). 
Generally, the name should be short, but do 
not abbreviate so much as to lose necessary 
information. 

4. The name should be neutral. It should rep¬ 
resent a concept, not a type or a part of the 
concept. 

5. Avoid adjectives, adverbs and excessive 
symbols in the name. 

6. If similar trait terms exist in another spe¬ 
cies, consider adapting them for use in pigs. 
This will help with future cross-species data 
comparisons and in bringing comparative 
information from other species. 

7. Consider a good abbreviation for the name. 
Making it phonically easy to pronounce and 
easy to remember will make it more likely to be 
well used. 

8. The name should be in English, using the 
American spelling. The abbreviations should 
be in capital Latin letters. If an original trait 
term used in a study is not English, its English 
equivalent is needed for publications. 

9. Lastly, check for possible name conflicts to 
avoid confusion. 
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Phenotype nomenclature 
recommendations 

A phenotype is the actual manifestation of 
observable traits. It is a trait observed in an 
individual. It usually consists of a trait with 
characteristic features (e.g. a litter size of nine 
piglets), variations that can be described (e.g. 
black spots on the body) or qualities that can 
be measured (e.g. birth weight of 1.51b). 
Because there are so many variations in how a 
phenotype can be ‘observed’ (often such 
observation is made indirectly with instruments 
or through tests) and obtained, a technical 
guide for recording each trait might be ideal. 
None the less, we recommend that the follow¬ 
ing components are included for a complete 
phenotype record: 

1. What is the observation for? This is nor¬ 
mally the trait type or super-trait name, such as 
ADG or carcass weight. 

2. How is the observation made? This refers 
to the methods used to measure or make the 
observation (e.g. balance, ruler, ultrasound 
machine, scores, etc.). 

3. How are the observational data processed? 
This is the calculation necessary to derive the 
results for recording (e.g. calculation of ADG, 
conversion of ultrasound value to length). 

4. What is the unit of the observed data (e.g. 
inch or centimetre for length, pound or gram for 
weight, percentage or ratio for fractions, etc.)? 

In addition, a description of comments for a 
phenotype record may be necessary for the 
data users to correctly understand and use the 
data. For example, when blood samples are 
taken, the number of hours the pig is fasted 
might be an important cofactor for the blood 
cholesterol concentration measured. 

When a phenotype is a reflection of certain 
genotypes, the phenotype symbol should be in 
the same characters as the genotype and allele 
symbols. The difference is that the characters 
should not be underlined or in italics, and they 
should be written with a space between locus 
characters and allele characters, instead of an 
asterisk. Square brackets, [ ], may also be used. 

In classical genetics, phenotypes are 
often used to denote Mendelian genotypes. 
This is done using an abbreviation of the trait, 


followed by a superscripted plus (+) or minus 
(-) sign to represent ‘presence’ or ‘absence’ of 
certain trait features. For example, halothane- 
negative is denoted as ‘Hal - ’ and halothane- 
positive as ‘Hal + ’. A phenotype denotation can 
also be used to represent genetic haplotypes, 
such that ‘K88ab + , ac + , ad - ’ are written 
together as an entire denotation. Likewise, 
numbers or letters may be used to denote 
alleles when polymorphisms are observed, for 
example, ApoB 1/2 , ApoB 273 , etc. (note the dif¬ 
ference from recording genotypes, where ital¬ 
ics or asterisks are required). 

Future Prospects 

The Gene Ontology and Mammalian Phenotype 
Ontology are already playing a role in robust 
annotation of mammalian genes and pheno¬ 
types in the context of mutations, quantitative 
trait loci, etc. (Smith eta/., 2005). Undoubtedly, 
a standardized pig genetic nomenclature will 
more effectively facilitate efficient pig genome 
annotation and transfer of knowledge from 
information-rich species such as humans and 
the mouse, and make it possible for new bio¬ 
informatics tools to easily streamline data man¬ 
agement and genetic analysis. 

Several genome databases, such as ArkDB 
(http://www.thearkdb.org/), Animal QTLdb 
(http://www.animalgenome.org/QTLdb), 
Ensembl (http://www.ensembl.org/) and NCBI 
GeneDB (http://www. ncbi. nlm. nih. gov/gene), 
have played a role in the usage of commonly 
accepted gene/trait notations. Undoubtedly, 
existing and new genome databases and tools 
will further develop and evolve. As such, a 
standardized genetic nomenclature in pigs will 
definitely become crucial for information shar¬ 
ing and comparisons between different research 
groups, across experiments and even across 
species. 
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Appendix I Genetic Glossary 

Bold words are glossary entries. Italicized words are concepts that may be independent glos¬ 
sary entries as well. 

Adaptation traits - Adaptation traits contribute to individual fitness and to the evolution of animal genetic 
resources. By definition, these traits are also important to the ability of the animal genetic resource to 
be sustained in the production environment. 

Additive genetic effects - The effect of an allele on animal performance, independent of the effect of the 
other allele at a locus. These effects of the two alleles at a locus add up (thus ‘additive’). Alleles at a 
locus may have other effects (dominance, epistasis), so that there are not genes that have just ‘addi¬ 
tive’ effects and other genes with only ‘dominance’ effects. Additive genetic effects can be inherited; 
other genetic effects such as dominance and epistasis are the result of allele combinations that are 
lost between generations. The additive genetic effect that an animal has for a trait is equal to its breed¬ 
ing value. 

Allele - One of a pair, or series of alternative forms, of a gene that can occur at a given locus on homolo¬ 
gous chromosomes. 

Amino acids - Any one of a class of organic compounds containing the amino (NH 2 ) group and the car¬ 
boxyl (COOH) group. Amino acids are combined to form proteins. 

Ancestor - Any individual from which an animal is descended. 

Animal model - A system for genetic evaluations that estimates breeding values of individual animals 
(males, females) at the same time. The system uses production data on all known relatives in calculat¬ 
ing a genetic evaluation. 

Assortative mating - Assigning animals as mates based on phenotypic or genetic likeness. Positive 
assortative mating is mating animals that are more similar than average. Negative assortative mating 
is mating animals that are less similar than average. 

Autosome - Any chromosome that is not a sex chromosome. 

Backcross - The cross produced by mating a first-cross animal back to one of its parent lines or breeds. 

Best linear unbiased prediction (BLUP) - A method of genetic prediction (with the properties of smallest 
variance, linear and unbiasedness) that is particularly appropriate when performance data come from 
genetically diverse contemporary groups. 

Breed - Either a subspecific group of domestic livestock with definable and identifiable external character¬ 
istics that enable it to be separated by visual appraisal from other similarly defined groups within the 
same species, or a group for which geographical and/or cultural separation from phenotypically simi¬ 
lar groups has led to acceptance of its separate identity. 

Breeding value - The mean genetic value of an individual as a parent. It can be estimated as the average 
superiority of an individual’s progeny relative to all other progeny under conditions of random mating. 

Categorical trait - Scores are given usually in a few categories up to several categories (e.g. scores 1-5 
for leg movement). 

Centromere - Spindle-fibre attachment region of a chromosome. 

Chromosome - Microscopically observable linear arrangement of DNA in the nucleus of a cell. 
Chromosomes carry the genes responsible for the determination and transmission of hereditary 
characteristics. 

Codominant alleles - Alleles, each of which produces an independent effect in heterozygotes. 

Combining ability - The mean performance of a line when involved in a crossbreeding system. General 
combining ability is the average performance when a breed or line is crossed with two or more other 
breeds or lines. Specific combining ability is the degree to which the performance of a specific cross 
deviates from the average general combining ability of two lines. 

Composite (synthetic) breed - A hybrid with at least two, and typically more, breeds in its background. 
Composites are expected to be bred to their own kind, retaining a level of hybrid vigour normally 
associated with traditional crossbreeding systems. 

Control line - A line that is randomly selected and randomly mated. Usually used in selection experiments 
to monitor environment effects in order to estimate genetic change in a selected line. 

Correlation coefficient - A measure of the interdependence of two random variables that ranges in value 
from -1 to +1, indicating perfect negative correlation at -1, absence of correlation at zero, and perfect 
positive correlation at +1. It determines the degree to which the movements of two variables are asso¬ 
ciated. No cause and effect is implied. 
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Covariance - The degree to which two measurements vary together. A positive covariance is when two 
measurements tend to increase together. A negative covariance is when one measurement increases 
and the other measurement tends to decrease. 

Crossbreeding - Matings between animals of different breeds or lines. 

Crossover -The process during meiosis when chromosomal segments from different members of a homo¬ 
logous pair of chromosomes break, and part of one will join a part of the other, so that two gametes 
form possessing new combinations of genes. The frequency of crossover between two loci is propor¬ 
tional to the physical distance between them. 

Crossover unit - Each unit is equal to a 1% frequency of crossover gametes. 

Cytoplasm - The protoplasm outside a cell nucleus. 

Descendant - An individual descended from other individuals. 

Diallel cross - When both males and females from each breed (or line) in a set of breeds (or lines) are 
mated to males and females of each breed (or line) in the set including their own breed (or line). 

DNA - Deoxyribonucleic acid, the chemical material that carries information to code for a gene. 

Dominant - Applied to one member of an allelic pair of genes, which has the ability to express itself wholly 
or largely at the exclusion of the expression of the other allele. 

Dominance genetic effects - The effect that an allele has on animal performance, which depends upon 
the genotype at the locus. For example, the ‘a’ allele may have a different effect on animal performance 
in ‘aa’ animals than in ‘Aa’ animals. See additive genetic effects. 

Economic trait loci - Loci that have effects on traits of economic importance. 

Economic value - A measure of the contribution an individual trait makes to the overall economic value of 
an animal. 

Environment - The aggregate of all the external conditions and influences affecting the life and development 
of the organism. 

Environmental correlation - When two traits tend to change in association with each other as a result of 
environmental effects. 

Environmental variance - Variation in phenotype that results from variation in environmental effects. 

Epistasis - When the gene at one locus affects the expression of the gene at another locus. 

Estimated breeding value - A prediction of a breeding value. See breeding value. 

F 1 - Animals resulting from crossing parents from different lines or breeds. 

F 2 - Animals resulting from matings among F 1 parents. 

F 3 - Animals resulting from matings among F 2 parents. 

Family size - The mean number of offspring per parent that successfully reproduce. 

Full sibs - Individuals having the same male and female parents. 

Gamete - A sperm or egg cell containing the haploid (1 n) number of chromosomes. 

Gene - A functional hereditary unit that occupies a fixed location on a chromosome, has a specific influence 
on phenotype and is capable of mutation to various allelic forms. 

Generation interval - The average age of the parents when the progeny that will replace them are 
born. 

Genetic correlation - When two traits tend to change in the same or opposite directions as a result of 
genetic effects. 

Genetic distances - A measure of gene differences between populations (hence genetic relationships 
among them) described by some numerical quantity; the gene differences are usually referred to as 
measured by a function of gene frequencies. 

Genetic drift - Changes in gene frequency in small breeding populations due to chance fluctuations. 

Genetic gain - The amount of increase in performance that is achieved through genetic selection after one 
generation of selection. 

Genetic maps - See linkage map. 

Genetic marker - A gene or DNA sequence having a known location on a chromosome and associated 
with a particular gene or trait; a gene phenotypically associated with a particular, easily identified trait 
and used to identify an individual or cell carrying that gene. 

Genetic variance - Variation in phenotype that results from variations in genetic composition among 
individuals. 

Genome - The complete set of genes and non-coding sequences present in each cell of an organism, or 
the genes in a complete haploid set of chromosomes of a particular organism. 

Genotype - The genetic constitution of one or a few gene( s) or locus (loci), or total genetic make-up 
(genes) of an individual organism. 
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Genotype-environment interaction - When the difference in performance between two genotypes differs, 
depending upon the environment in which performance is measured. This may be a change in the 
magnitude of the difference or a change in rank of the genotypes. 

Germplasm - The germinal material or physical basis of heredity; the sum total of the genes. 

Grade-up - The process of repeated backcrossing to one parental line to produce a population that is 
nearly purebred. 

Half sibs - Individuals that share only one common parent. 

Haplotype - A set of alleles at a closely linked group of loci, so closely linked that the allelic set behaves 
almost as one allele in terms of inheritance. 

Hardy-Weinberg law - A population is in genotypic equilibrium if p and q are the frequencies of alleles 
A and a, respectively, and p 2 , 2 pq and cf are the genotypic frequencies of AA, Aa and aa under the 
condition of random mating. 

Heritability - Degree to which a given trait is controlled by inheritance; proportion of total phenotypic variation 
that is attributable to genetic variation (in contrast to environment-caused variation). 

Heterosis - The degree to which the performance of a crossbred animal is better or worse than the average 
performance of the parents. 

Heterozygote, adj. heterozygous - An organism with unlike members of any given pair or series of alleles, 
which consequently produces unlike gametes. 

Homologous chromosomes - Chromosomes that occur in pairs and are similar in size and shape, one 
having come from the male and one from the female parent. 

Homozygote, adj. homozygous - An organism whose chromosomes carry identical members of a given 
pair of genes. The gametes are therefore all alike with respect to this locus. 

Inbreeding - Matings among related individuals, which result in progeny that have less heterozygosity and 
hence more homozygous gene pairs than the average of the population. 

Inbreeding coefficient - A measurement of the increase in homozygosity, each unit is equal to a 1% 
increase in homozygosity relative to the average homozygosity in the base population. 

Inbreeding depression - The decreased performance normally associated with accumulation of inbreed¬ 
ing. Many recessive genes result in undesired traits or decreased performance when they are 
expressed. Inbred animals have more recessive genes in the homozygous condition that are expressed 
and result in reduced performance or undesired traits. 

Independent culling - When animals are culled if they do not meet all of the minimum levels of performance 
for a set of traits. 

Introgression - A breeding strategy for transferring specific favourable alleles from a donor population to 
a recipient population. This would, for example, be of great interest for genes responsible for disease 
resistance, which could be introgressed into a susceptible, but otherwise economically superior, 
breed. 

Karyotype - The appearance of the metaphase chromosomes of an individual or species, which shows 
the comparative size, shape and morphology of the different chromosomes. 

Lethal gene - A gene that results in the death of the animal. 

Liability - Both internal (e.g. genetic merit) and external (e.g. nutrition, disease, exposure) forces that influence 
the expression of a threshold character (e.g. disease, conception, abnormalities, etc.). 

Line-breeding - Mating of selected individuals from successive generations to produce animals with a 
high relationship to one or more selected ancestors. It is a mild form of inbreeding. 

Linkage - Association of genes physically located on the same chromosome. A group of linked genes is 
called a linkage group. 

Linkage map - A linear map of an experimental population that shows the position of its known genes and/ 
or genetic markers relative to each other in terms of recombination frequency. 

Locus, pi. loci - A fixed position on a chromosome occupied by a given gene or one of its alleles. 

Major gene - A gene that has an easily recognizable and measurable effect on a characteristic. 

Marker - Specific and identifiable sequences of the DNA molecule. These markers may or may not be 
functional genes. 

Marker-assisted selection (MAS) - Selection for specific alleles using genetic markers. 

Maternal heterosis - The advantage of the crossbred mother over the average of purebred mothers. 

Mating systems - The rules that describe how selected breeds and/or individuals will be paired at 
mating. 

Meiosis - The process by which the chromosome number of a reproductive cell becomes reduced to half 
the diploid (2 n) or somatic number and results in the formation of eggs or sperm. 


Standard Genetic Nomenclature of the Pig 


485 


Migration - Movement of animals, and consequently genes, from one population to another. 

Mitochondria - Small bodies in the cytoplasm of most plant and animal cells that are responsible for 
energy production. 

Mitosis - Cell division process in which there is first a duplication of chromosomes, followed by migration 
of chromosomes to the ends of the spindle and a dividing of the cytoplasm, resulting in the formation 
of two cells with a diploid (2 n) number of chromosomes. 

Molecular genetics - The branch of genetic studies that deals with hereditary transmission and variation 
on the molecular level. It deals with the expression of genes by studying the DNA sequences of 
chromosomes. 

Multiple alleles - Three or more alternative forms of a gene representing the same locus in a given pair of 
chromosomes. 

Mutation - A sudden change in the genotype of an organism. The term is most often used in reference to 
point mutations (changes in base sequence within a gene), but can refer to chromosomal changes. 

Natural selection - Natural processes favouring reproduction by individuals that are better adapted, and 
tending to eliminate those less adapted to their environment. 

Nucleus - Part of a cell containing chromosomes and surrounded by cytoplasm. 

Outcrossing - Mating of individuals that are less closely related than the average of the population. 

Overdominance - A form of dominance where the performance of the heterozygote exceeds that of the 
best homozygote. 

Partial dominance - A form of dominance where the performance of the heterozygote is intermediate 
between the two homozygotes, but more closely resembles the performance of the homozygous 
dominant type. 

Pedigree - Usually refers to a pedigree chart or what a pedigree chart represents in genetics. It is a docu¬ 
ment to record the ancestry of an individual. A pedigree can also be used to illustrate the family 
structure or breeding scheme. 

Penetrance - The proportion of the individuals with a particular gene combination that express the 
corresponding trait. 

Permanent environmental effects - Environmental effects that result in permanent effects on the phenotypic 
expression of a trait. For example, severe mastitis during lactation may have a permanent effect on milk 
production and litter weaning weight for an animal in subsequent litters. 

Phenotype - Actual exhibit of observable traits. Normally, it refers to characteristics of an individual such 
as size, shape, colour or performance. 

Phenotypic correlation - When two traits tend to change in the same or different direction as a net result 
of genetic and environmental effects. 

Phenotypic value - A performance record; a measure of an animal’s performance for a trait. 

Phenotypic variation - Variation in phenotype that results from variation in genetic and environmental 
effects on the individuals. 

Pleiotropy - The property of a gene whereby it affects two or more characters, so that, if the gene is 
segregating, it causes simultaneous variation in the characters it affects. 

Polymorphism - Where DNA or genes have more than two forms or alleles in the population. 

Population - Entire group of organisms of a kind that interbreed. 

Population genetics - The branch of genetics that deals with frequencies of alleles in groups of 
individuals. 

Progeny - Offspring or individuals resulting from specific matings. 

Progeny test - A test used to help predict an individual’s breeding values, involving multiple matings of 
that individual and evaluation of its offspring. 

Protein - Any of a group of complex nitrogenous organic compounds that contain amino acids as their basic 
structural units, occur in all living matter and are essential for the growth and repair of animal tissue. 

Qualitative trait - A trait that can generally be classified into a limited number of categories, and the animal 
can be said to ‘possess’ the quality or not. Examples include hair colour, skin colour and ear stature. 

Quantitative trait - A trait that is represented by an almost continuous distribution of measurements. 
Examples include average daily gain, backfat thickness and height. 

Quantitative trait locus (QTL) - A locus that affects a quantitative trait. 

Random mating - A mating system in which animals are assigned as breeding pairs at random, without 
regard to genetic relationship or performance. 

Recessive - Applies to one member of an allelic pair that lacks the ability to manifest itself when the other, 
dominant member is present. 
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Reciprocal cross - A breeding scheme where males of breed A are mated to females of breed B and 
males of breed B are mated to females of breed A. 

Reciprocal recurrent selection - A method of selection for combining ability or heterosis. Selection within 
two lines is based on the performance of crossbred progeny produced by crossing the two lines. 

Recombination - The observed new combinations of DNA segments, or loci, or traits, which are different 
from those combinations exhibited by the parents. 

Recurrent selection - A method of selection for combining ability or heterosis. Selection within one line is 
based on performance of crossbred progeny from matings with a ‘tester’ line. 

Repeatability - The proportion of total phenotypic variation that is attributable to variations caused by 
genetic and permanent environmental effects. It is a measure of the degree to which early measures 
of a trait can predict later records of the same trait. 

RNA - Ribonucleic acid, involved in the transcription of genetic information from DNA. 

Segregation - The separation of paired alleles at loci during germ cell formation. 

Selection - Any natural or artificial process favouring the survival and propagation of certain individuals in 
a population. 

Selection criteria - The character(s) upon which selection decisions are based, with the intent of changing 
the character(s) in the selection objective. 

Selection differential - The difference in mean performance of the selected group of animals relative to 
the mean performance of all animals available for selection. 

Selection index - The combining of measurements from several sources into an estimate of genetic value; 
when more than one measurement on a trait, and/or measurements of the trait on relatives, and/or 
measurements of more than one trait are combined into a single estimate of overall genetic value. 

Selection intensity - The proportion of animals selected to be parents relative to the total number available 
for selection. The smaller the proportion selected, the higher the selection intensity. 

Selection objective - The character(s) that are intended to be modified by selection. 

Sex chromosomes - The X orY chromosomes. 

Sex-influenced - Traits for which the expression depends on the sex of the individual. 

Sex-limited - A trait that can be expressed only in one sex, such as milk production. 

Sex-linked - Genes that are located on the sex (X orY) chromosomes. 

Synthetic breed - See composite breed. 

Zygote - The cell produced by the union of mature gametes (egg and sperm) in reproduction. 



Standard Genetic Nomenclature of the Pig 


487 


Appendix II Pig Trait Glossary 

The pig traits are mainly abridged from the Animal Trait Ontology (ATO) database (http://www. 

animalgenome.org/bioinfo/projects/ATO/). 

Abdominal fat (ABDF) - Abdominal fat measured by weight and/or converted to percentage. 

ACTH basal level (ACTH1) - Level of adrenocorticotrophic hormone produced by the anterior lobe of the 
pituitary gland that stimulates the secretion of cortisone and other hormones by the adrenal cortex. 

Age (AGE) - Age at certain physiological or production stage. Often measured at times of farrowing 
(AGELF), puberty (AGEP), slaughter (ENDAGE) or other times. 

Androstenone, laboratory (ANDR) - Laboratory measurement of androstenone concentration. 

Anti-K88 E. coli IgG level (K88Ab) - Laboratory detected or quantified anti-K88 antibody level (IgG) to 
Escherichia coli. Also denoted as IgG levels to K88 antigen (IGG2K88). 

Anti-0149 E. coli IgG level (0149AbR) - Laboratory detected or quantified anti-0149 antibody level 
(IgG). Also denoted as IgG levels to E. coli 0149 (IGG20149). 

Aspartate amino transferase activity (AST) - Laboratory measurement of AST enzymatic activity. 

Average chain length (FA-ACL) - Average number of carbon atoms comprising the fatty acids in a fat 
sample. 

Average daily feed intake (FEEDIN) - Daily feed intake by a pig averaged over a period of time. 

Average daily gain (ADG) - Body weight gain measured during various time points, e.g. birth to weaning, 
weaning to marketing (also called fattening ADG), or specific test periods, such as between 0 and 3, 
4, 5, 7, 10, 18 or 26 weeks of age, etc. 

Average glycolytic potential (GLYPO) - An estimator of resting glycogen content measured in vivo on 
porcine muscle. 

Average lactate (LACT) - Laboratory measurement of lactate concentration in muscle. 

Backfat thickness (BF) - Thickness of the layer of fat that lies under the skin along the back of the pig. The 
measurement may vary depending on the method (e.g. Fat-O-Meter, ultrasound, ruler), anatomical 
location for taking the measure, the age and body weight (both represent physiological stage of the 
animal), etc. Also called fatback, subcutaneous fat thickness (SCFT). 

Backfat weight (BFW) - Backfat amount, measured by dissected weight. May also be converted to a 
percentage (BFP). 

Band-formed neutrophil number (BFNEUT) - Number of immature neutrophils undergoing granulopoiesis. 
Nuclei are unsegmented. 

Basal cortisol level (CORT) - Blood cortisol concentration in the absence of external intervention. 

Basal glucose level (GLU) - Blood glucose concentration in the absence of external intervention. 

Base excess (BASEEX) - Refers to the amount of acid required to return the blood pH of an individual to 
the reference interval (pH 7.35-7.45), with the amount of carbon dioxide held at a standard value. 

Basophil number (BASO) - Number of basophilic granulocytes. 

Belly fat area (BFA) - The belly fat size as measured by area. 

Belly meat content (BYLEAN) - Meat content measured by weight and/or converted to percentage (ratio). 

Belly weight (BELLYWT) - Weight of belly; an indirect indication of fat deposition. 

Bilirubin level (BILIR) - Level of this waste product formed from old red blood cells; used as an indicator 
of carcass quality. 

Blood pH (BLPH) - pH measured from blood samples. 

Body length (BODYL) - Measured from the tail head to the point of the shoulder when the head is down. 

Body mass index (BMI) - A number calculated from weight and height, as an indicator of body conformation. 

Body weight (BW) - Live body weight measured at various time points, for example, at birth (birth weight, 
BW), at weaning (weaning weight, WWT), at slaughter (slaughter weight, SWT), etc. 

Bone weight (BONEWT) - Total bone weight in carcass. May also be expressed as a percentage (BP). 

Calcium level (BCAL) - Laboratory measurement of blood calcium concentration. 

Carbon dioxide level (C0 2 ) - Laboratory measurement of blood carbon dioxide level. 

Carcass fat-free weight (CWNOFAT) - Carcass weight minus weight of fat. Also referred to as carcass 
lean weight. 

Carcass length (CRCL) - Length of carcass, measured post-mortem. 

Carcass temperature (CARTEMP) - Usually measured at 45 min or 24 h post-mortem. 

Carcass traits - Criteria used in assessing the quality of a carcass. Important in determining the price, 
suitability of breeding programme and value of sire. This may include length, weight, proportion of fat 
and lean, distribution of fat, relative size of valuable cuts, etc. 
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Carcass weight (CWT or HCWT) - Carcass weight measured either right after slaughter (HCWT) or 24 h 
after the slaughter (CWT). 

Carcass width, minimum (CRCW) - Width of the carcass measured at the narrowest point. 

Carcass yield (CYIELD) - Proportion of the pig’s live weight salvaged at the point the carcass is har¬ 
vested. Also called dressed weight, dressing percentage, carcass dressing or killing out 
percentage. 

Chew score (CHEW) - A subjective score for chewiness as judged by a human on a taste panel. Also 
called chewiness score (CHEW). 

Cholesterol (CHOL) - Blood cholesterol concentration, usually in mg/dl. 

Chroma (CHROMA) - A measure of meat colour intensity, calculated based on a/a* and b/b* colour 
coordinates. 

C0 2 partial pressure (PC02) - A blood parameter to describe haemoglobin oxygen-binding capacity as 
an indicator of health status. (A rise in the partial pressure of C0 2 or a lower pH will cause offloading 
of oxygen from haemoglobin.) 

Coat color (CC) - Colour of the coat/hair. 

Color density (COLORD) - Total colour density derived from the carcass image analysis. 

ConA-induced cell proliferation (ConA) - Measurement of lymphocyte proliferation upon stimulation 
with concanavalin A. An indicator of immune capacity. 

Conductivity post-mortem (COND) - Conductivity in muscle, typically measured at 45 min or 24h 
post-mortem. 

Conformation - Visual or measurable variation in body shape or proportions. Often assessed with a sub¬ 
jective scoring system. 

Congenital - A condition present at birth but not necessarily hereditary. 

Cooking loss (COOKL) - Loss by weight during the cooking process. 

Cooling loss (CLOSS) - Loss by weight during the post-mortem carcass cooling process. 

Cortisol level variations (dCORT) - Differences in blood cortisol levels. It is an indirect indicator of stress 
response. 

Creatine kinase level (CK) - Measurement of creatine kinase concentration, typically as a stress response 
parameter. May also be referred to as creatine phosphokinase (CPK). 

Creatinine level (CREAT) - Creatinine concentration, used as an alternative screening test for an abnormal 
hyperpyrexia response to halothane. 

Diameter of muscle fibers - Average diameter of myofibres. Can include all fibre types, or specifically 
angular, giant, type I, type lla, or type lib (white; WFIBD) muscle fibres. 

Diaphragm weight (DIAWT) - Dissected weight of the diaphragm. 

Double-bond index (FA-DBI) - An assessment of the number of double bonds in a fat sample. Serves as 
an indicator of fat quality. 

Dressing percentage (DRESSP) - See carcass yield. 

Drip loss (DRIPL) - Fluid loss from fresh meat due to passive exudation. Also known as purge loss 
(PURGEL). 

Ear erectness (EARER) - Determination of whether an animal’s ears flop down, stand straight up or are 
intermediate; usually scored. 

Ear size (EARSZ) - Determination of the outer ear size. 

Eosinophil number (EOS) - Number of one of the types of immune cells responsible for combating infection 
and parasites. 

Epididymis weight (EPIDW) - Weight of the epididymis by dissection. 

Estimated carcass lean content (ECLC) - Estimated lean meat content of the carcass. 

Estrone, laboratory (ESTR) - Laboratory determination of oestrone concentration. 

Exploration during stress (EXPL) - Time spent rooting, sniffing the floor or the sides of the test arena 
during test period. 

External fat (EFAT) - External fat measured on ham, loin or shoulder (shoulder fat thickness, subcuta¬ 
neous fat depth at shoulder). 

Fat area (FATAREA) - Fat area on musculus lattissimus dorsi. 

Fat content (FCON) - Fat content of meat, determined based on laboratory methods. May also be referred 

to as total lipid (TOTLIP). 

Fat content, total - Fat content of the carcass by weight (FATWT), by percentage (FPIC) or by ratio to 
lean (FP). May also be referred to as fat weight (FATWT) or total body fat tissue (FATTIS). 

Fat-cuts percentage (FATCP) - Dissectible fat cut percentage. 
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Fatty acid percentage - Fatty acid composition of a fat sample, as it relates to specific fatty acids or types 
of fatty acids - FA-C14:0, FA-C16:0, FA-C16:1, FA-C17:0, FA-C18:1, FA-C18:1n-7, FA-C18:2, 
FA-C18:3, FA-C20:0, FA-C20:1, FA-C20:2, FA-C20:3, FA-C20:4, FA-C20:5, FA-C22:4, FA-C22:5, 
FA-C22:6, SFA, MUFA, PUFA, etc. 

Feed conversion ratio (FCR) - A measure of an animal’s efficiency in converting feed mass into increased 
body mass. Also called feed conversion rate, feed conversion efficiency (FCE). 

Feed intake (FEEDIN) - Feed consumption, usually over a defined period of time. 

Feed intake per feeding - Feed consumed during each visit to the feeder, usually measured during the 
fattening period. 

Feet and legs score (FLSCORE) - A subjective conformation score of feet and legs, often during 
movement. 

Feet score, front (FSCOREF) - A subjective front feet conformation score. 

Feet score, rear (FSCORER) - A subjective rear feet conformation score. 

Feet weight (FEETWT) - Weight of the feet. 

Fiber type ratio - Ratio of myofibre types, based on myosin isoform ratio, for type 2 (FIB2R) or type 1 
(FIBERI) fibres. 

Firmness (FIRM) - Subjective firmness score; an indicator of meat texture. 

Flavor score (FLAV) - Subjective flavour score, as judged by a human on a taste panel. 

Foreloin weight (LWT) - Weight of the foreloin. 

Gestation length (GEST) - Number of days in gestation. 

Glucose level variations (dGLU) - Changes in blood glucose concentration. 

Glucose-6-phosphate content (GLU6P) - Laboratory measurement of glucose-6-phosphate. 

Half carcass weight (HALFCW) - Weight of a half carcass. 

Ham fat (HFAT) - Fat in ham, expressed as weight (HFATW), thickness (HFT) or percentage (HFATP). 

Ham meat (HMEAT) - Meat in ham, measured by weight or expressed as a percentage. 

Heart weight (HWT) - Weight of the heart. 

Height (HEIGHT) - A general representation of body height measured at the shoulder. 

Hematocrit (HCT) - Proportion of blood volume that is occupied by red blood cells. Also called packed 
cell volume (PCV). 

Hemoglobin (HGB) - Laboratory measurement of blood haemoglobin concentration. 

Humerus length (HUML) - Forelimb bone length measured on dissected carcass. 

IgG levels to K88 antigen (lgG2K88) - See Anti-K88 E. coli IgG level. 

Indole, laboratory (INDO) - Laboratory determination of indole concentration. 

Inside ham weight (IHAM) - Weight of semimembranosus, gracillis and adductor muscles. 

Internal fat ratio (IFR) - Fat deposited around internal organs, expressed as a percentage. 

Intestinal fat - Intestinal fat by weight (IFW), or expressed as a percentage (IFP). 

Intramuscular fat content (IMF) - Amount of fat present within muscle. Usually expressed as a percent¬ 
age of total muscle weight. 

Jowl weight (JOWLWT) - The weight of the cheek meat of a hog. 

Juiciness score (JUICE) - A subjective sensory score, as judged by a human on a taste panel. 

Knuckle ham weight (KHAM) - Weight of vastus intermedius, vastus lateralis, tensor fasciae and vastus 
medialis muscles. 

Lactate level (BLACT) - Laboratory measurement of lactate concentration in the blood. 

Leaf fat - Leaf fat, expressed as weight (LFW) or percentage (LFP). 

Lean mass (LEANWT) - Amount of lean meat, measured by weight. 

Lean thickness (RIBLT) - Meat depth measured between the 3rd and 4th or at the last rib. 

Leg angularity (LANG) - Leg angularity relative to the vertical; on front (LANGF) or hind (LANGH) legs. 

Leg length (LEGLEN) - Length of the legs. 

Leg pastern (LPAST) - Leg scores as part of body conformation assessment; for front (LPASTF) or hind 
(LPASTH) pasterns. 

Leg score (LSCORE) - Leg scores as part of body conformation assessment; for front (LSCOREF) or 
hind (LSCOREH) legs. 

Leg set view (LSETV) - An assessment of leg placement in relation to the body. 

Leg torsion (LTORS) - Degree of twisting or torsion of the legs; on front (LTORSF) or hind (LTORSH) 
legs. 

Length of small intestine (LSI) - Length of the small intestine. This usually reflects the digestive capacity 
of an individual. 
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Lipid accretion rate (LAR) - Rate at which lipids are deposited over time. 

Litter size - Number of offspring in a litter. Also referred to as total number born (TNB). 

Liver weight (LIVWT) - Weight of the liver. 

Locomotion (LOC) - A behavioural trait, recording the number of test arena sections entered during a 
stress test. 

Loin eye area (LEA) - See longissimus dorsi muscle area. 

Loin meat weight (LOINMWT) - Weight of the dissected loin with fat removed. 

Loin muscle depth (LMDEP) - Depth or height of the loin; usually measured by ultrasound, at the last rib. 
Also referred to as eye muscle depth (EMD). 

Loin muscle width (LMWID) - Loin width; usually measured by ultrasound, at the last rib. 

Loin weight (LOINWT) - Total weight of dissected loin. May also be expressed as a percentage of carcass 
weight (LOINP). 

Longissimus dorsi muscle area (LMA) - Cross-sectional area of the longissimus dorsi muscle. Also 
referred to as loin muscle area, loin eye area (LEA), ribeye area (REA). 

Longissimus muscle length (LOINLENGTH) - Length of the longissimus dorsi muscle. 

Lymphocyte number (LYMPH) - Number of a type of white blood cell. Used as a measure of immune 
capacity. 

Marbling (MARB) - Visible intramuscular fat in a cut of meat, which gives it an appearance similar to a 
marble pattern. Degree of marbling is a meat quality trait. 

Mean corpuscular volume (MCV) - A measure of the average red blood cell volume. 

Mean platelet volume (MPVOL) - Average platelet volume, as measured with capillary tubes. 

Meat color-a (MCOLOR-a) - Meat redness, measured by Minolta, Hunter or CIE coordinate. 

Meat color-b (MCOLOR-b) - Meat yellowness, measured by Minolta, Hunter or CIE coordinate. 

Meat color-L (MCOLOR-L) - Meat lightness, measured by Minolta, Hunter or CIE coordinate. 

Meat color score (MCOLOR) - A subjective score for observed meat colour. 

Meat-to-fat ratio (MFR) - Ratio between dissected meat and fat, by weight. 

Melanoma susceptibility (MELAN) - Susceptibility to melanoma, a dark-pigmented, usually malignant 
tumour arising from a melanocyte; occurs most commonly in the skin. 

Melting point (MP) - A characteristic feature of fat, serves as an indication of fat quality in terms of the 
amount of saturated versus unsaturated fat. 

Muscle fiber type percentage (AFIBP) - Percentage of various fibre types in a muscle cross-section; 
includes angular, giant, Type I, Type lla, Type lib muscle fibres. 

Neck weight (NECKWT) - Weight of the dissected neck. 

Nipple number (NN) - See teat number. 

Nonfunctional nipples (NNIP) - Nipples through which milk cannot be drawn. May also be referred to as 
pin nipples or inverted nipples. 

Nose score (NOSESC) - Nose conformation score. 

Number of stillborn (NSB) - Total number of stillborn in a farrowing. 

Off-flavor score (OFFFLAV) - A sensory flavour score used to describe the intensity of any flavour not 
associated with normal pork flavour. Also called subjective abnormal flavor score, in fat (AFLFAT) 
or lean (AFLLEAN). 

Ovary weight (OW) - Weight of the ovary. 

Ovulation rate (OVRATE) - Frequency of ovulation, determined by the number of corpora lutea in the 
ovary. 

Parasite load (PARAS) - Number of parasites present in a sample. Also called parasite burden. 

pH for [tissue] (pH) - Muscle pH, where [tissue] can be biceps femoris, longissimus dorsi, ham, semimem¬ 
branosus, semispinalis capitis, semispinalis dorsi, etc. Measured at different time points, e.g. 45 min, 
24 h or48h post-mortem. 

Plasma FSH concentration (FSH) - Follicle-stimulating hormone (FSH) concentration in blood plasma. 

Platelet count (PLTCT) - Blood platelet counts. 

Pork flavor (PFLFAT) - Subjective score used in a taste panel to describe the intensity of characteristic 
pork flavour. 

Production traits - Characteristics of animals, such as the quantity or quality of the milk, meat, fibre, eggs, 
draught, etc. they (or their progeny) produce, which contribute directly to the value of the animals for 
the farmer, and that are identifiable or measurable at the individual level. Production traits of farm ani¬ 
mals are generally quantitatively inherited, i.e. they are influenced by many genes whose expression in 
a particular animal also reflects environmental influences. 
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Protein content (PC) - Amount of protein in muscle, usually determined by chemical methods; measured 
by weight. 

Purge loss (PURGEL) - See drip loss. 

Relative area of muscle fibers (RAFIB) - Average muscle fibre area, includes Type I, Type lla and Type 
lib muscle fibres. 

Resistance to E. coli K88 infection (K88R) - Ability of a pig to exhibit resistance to K88 E. coli infection. 

Resistance to pseudorabies (PrV) - Ability of a pig to exhibit resistance to pseudorabies virus. 

Rib and vertebral number - Number of ribs and vertebrae. 

Ribeye area - See longissimus dorsi muscle area. 

Rib weight (RIBWT) - Weight of all ribs from a carcass. 

Segmented neutrophil number (SNEUT) - Number of fully mature neutrophils. 

Shear firmness (SHFRM) - A texture measurement usually tested with shear force. 

Shear force (SHEAR) - An instrument test for meat resistance during shearing; an indicator of 
tenderness. 

Shear force at first peak (SF1P) - The shear force measured at the first peak only. 

Spotted coat (CCOLOR) - Presence of different coloured spots on the coat. 

Stress-induced leukocyte proliferation (LEUKPRO) - Increase in leucocyte proliferation when the subject 
is under stress. 

Subcutaneous fat thickness (SUBCFAT) - Thickness of the layer of fat directly beneath the skin. 

Subjective abnormal odor (ABODOR) - Subjective scores describing the intensity of abnormal meat 
odour. 

Subjective overall acceptability score (OVACCS) - A combined taste panel score describing overall 
quality. 

Teat number (TN) - Number of teats on a sow in total (TN), on the left side (LTN) or on the right side 
(RTN). Also referred to as nipple number (NN). 

Tenderness score (TEND) - A subjective assessment of meat tenderness, determined by a human on a 
taste panel. 

Thoracic vertebra number (TVNUM) - Vertebra number in the thoracic section. 

Total number born alive (NBA) - Number of offspring, minus the number of stillborn animals. Also called 

prenatal survival. 

Total number of fibers (TNMF) - Number of myofibres in a section of muscle. 

Total shear work (TSW) - Amount of work required to shear a meat sample; equal to shear force times 
displacement (joules). 

Unsaturated index (FA-UI) - Measurement of the degree of fatty acid unsaturation. 

Unsaturated to saturated fatty acid ratio (USR) - The ratio of unsaturated to saturated fatty acids in a 
sample of fat. 

Uterine capacity (UC) - An indicator of the ability of the uterus to provide the necessary environment to 
maintain fetuses until farrowing. 

Uterine horn size - Can be measured by length (UHL) or by weight (UHW). 

Water holding capacity (WHC) - The ability of meat to retain water during cutting, heating, grinding and 
pressing. A quality trait that affects product processing, appearance and shelf life. Measured by weighing 
filter paper after it has been placed in contact with meat and collected moisture. 

White blood cell counts (WBC) - The number of white blood cells per volume of blood. A high WBC count 
can be an indicator of infection. 
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Appendix III Pig Disease and Defect Glossary 

Pig diseases and defects were primarily obtained from the Online Mendelian Inheritance in 

Animals (OMIA) database (http://omia.angis.org.au/). 

Agenesia of anal sphincter - Congenital absence of anal sphincter. 

Agnathia - Congenital absence of the lower jaw. 

Anal atresia (ANATR) - An abnormality where congenital absence of an opening at the bottom end of the 
intestinal tract is observed. 

Anemia - Abnormally low number of red blood cells (erythrocytes) or abnormally low quantity of haemoglobin. 

Anophthalmos - Congenital absence of one or both eyes, or the presence of rudimentary eyes. 

Aplasia of tongue - Congenital absence of the median portion of the apex of the tongue. 

Artery, anomaly of - Congenital abnormality of an artery. 

Arthritis - Inflammation of a joint. Used interchangeably with arthrosis, which refers to degenerative dis¬ 
ease of joints. 

Arthritis deformans - Erosion of articular cartilage and destruction of sub-chondral bone. Regarded as 
an autoimmune disease. 

Arthrogryposis - Persistent flexion of a joint. 

Asymmetric hindquarter syndrome - Asymmetry of the hindquarters. 

Ataxia, progressive - A progressive failure of muscle coordination, resulting in perverse movements. 

Atherosclerosis - A common form of arteriosclerosis in which deposits of yellowing plaques (atheromas) 
containing cholesterol and other lipid material are formed within the arteries. 

Atresia ani - Congenital absence of the anus, causing a build-up of faeces and consequent distension of 
the abdomen. 

Atresia ilei - Congenital absence of the ileum (the distal portion of the small intestine, extending from the 
jejunum to the caecum). 

Atrophic rhinitis - A chronic inflammation of the nose, characterized by atrophy of nasal mucosa, including 
the glands, turbinate bones and the nerve elements supplying the nose. 

Blood group system - A set of blood types, each corresponding to a particular antigen on the surface of 
red blood cells. Includes blood groups A, B, C, D, E, F, G, H, I, J, K, L, M, N, O, S. 

Brachydactyly - Abnormal shortness of the digits. 

Brachygnathia superior- Congenital abnormal shortness of the maxilla (upper jaw), resulting in protrusion 
of the mandible (lower jaw). 

Cardiomyopathy, dilated - A disorder characterized by cardiac enlargement (especially of the left ventricle), 
poor myocardial contractility and congestive heart failure. 

Cardiomyopathy, hypertrophic - Increase in volume of the muscle tissue of the heart due to an increase 
in the size of muscle cells, primarily in the left ventricle and ventricular septum. 

Cataract - Opacity of the lens of the eye. 

Cephalothoracophagus - A twin ‘monster’ united at the head, neck and thorax. 

Cerebellar anomaly, congenital - Incoordination, ataxia, dizziness and unthriftiness, resulting in death. 
Associated with misshapen nuclei in some of the Purkinje’s cells in the cerebellum. 

Cerebellar hypoplasia - Underdevelopment of the cerebellum. 

Chondrodysplasia - Abnormal growth of cartilage, leading to disproportionate dwarfism. 

Cleft palate - Congenital fissure (split) that involves the hard or soft palate (roof of the mouth). 

Cloaca - Absence of the anus, causing faeces and urine to be voided through the vulva. Named after the 
structure in birds through which faeces and urine are normally voided together. 

Cranioschisis - A cleft in the skull involving one or more of the bones of the cranial cavity. 

Cryptorchidism - Failure of one (unilateral) or both (bilateral) testes to descend. 

Cyclopia - A congenital developmental disorder characterized by a single orbital fossa (eye socket). 
Named after the race of one-eyed giants of Greek mythology. 

Cystic bile ducts and renal tubules - Animals appear normal at birth, but develop distended abdomens 
and show signs of distress between 24 and 48 h. Death follows within 10 days. 

Dicephalus - Developmental disorder resulting in two heads. 

Dwarfism, chondrodystrophy - A skeletal disorder caused by one of the myriad genetic mutations that 
can affect the development of cartilage; leads to small body size. 

Epididymal aplasia - Total or partial failure of development of the epididymis. 
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Epitheliogenesis imperfecta - Congenital absence of the skin. 

Fragile site - A site on a chromosome that does not stain, at which a break in the chromosome often 
occurs. 

Gangliosidosis, GM2 - A lysosomal storage disease in which there is a build-up (storage) of GM2 gan- 
gliosides (a type of glycolipid) in various tissues, due to the lack of the enzyme hexosaminidase, 
whose task is to break down the GM2 ganglioside into its constituents. 

Goiter, familial - Enlargement of the thyroid gland, causing a swelling in the front of the neck - a goitre. 

Harelip - A congenital disorder comprising a cleft in the upper lip, resulting from failure of fusion between 
the maxillary and medial nasal processes. 

Hemophilia A - Impaired coagulability of the blood, with a consequential strong tendency to bleed, result¬ 
ing from a deficiency of clotting factor VIII. 

Hernia - Protrusion of part of an organ or tissue through the structures normally containing it. Depending 
on the locations and degrees, the types of hernia include diaphragmatic hernia (the displacement of 
abdominal organs into the thorax), inguinal hernia (part of the intestine protrudes through the 
abdominal wall; IHERN), scrotal hernia (inguinal hernia that has passed into the scrotum), umbilical 
hernia (protrusion of part of the intestine or other abdominal organs through the abdominal wall at the 
umbilicus), etc. 

Heterochromia iridis - Difference in colour of the iris in the two eyes, or in different areas of one iris. 

Hind limb paralysis - Paralysis of the hindlimb. 

Hip dysplasia - Laxity of the hip joint, resulting from a shallow acetabulum and/or a small, misshapen 
head of the femur. In severe cases, affected animals are lame. 

Histiocytosis, malignant - Also known as histiocytic sarcoma (HS). 

Holoprosencephaly - Developmental failure of cleavage of the forebrain (prosencephalon), with a deficit 
in midline facial development and with cyclopia in the severe form. 

Hydrocephalus - Enlargement of the cranium caused by accumulation of fluid. 

Hydrops fetalis - Accumulation of fluid in the whole body of the newborn. 

Hypercholesterolemia - The presence of high levels of cholesterol in the blood. 

Hyperlipidemia - An elevation of lipids (fats) in the bloodstream. 

Hyperostosis - An excessive growth of bone. 

Hypocatalasia - A genetic disorder caused by deficiency of catalase enzyme. 

Hypomyelinogenesis, congenital - Inadequate synthesis of myelin; includes failure of formation of myelin, 
plus incomplete and delayed myelination of axons. 

Hypotrichosis, dominant - Dominant form of hair loss. 

Hypotrichosis, juvenile with age-dependent emphysema - Juvenile hairlessness and age-dependent 
emphysema. 

Hypotrichosis, recessive - Recessive form of hair loss. 

Infertility - Inability to conceive. Has a wide variety of causes, some of which are at least partly genetic. 

Intersex - A mixture of male and female characteristics, often due to an abnormality of the sex 
chromosomes. 

Joint abnormalities (JOINTAB) - An assessment of abnormality in the joints, usually scored. 

Joint lesions (JOINTL) - Presence of lesions on the joint. 

Kartagener syndrome - Congenital defect in functioning of the cilia. 

Knobbed acrosome -The presence of a protrusion (knob) on the acrosome (cap) of a sperm. Associated 
with sterility. 

Leg weakness - Evidence of mis-formed legs (front or rear). May be a congenital defect of the limbs that 
prevents standing. 

Lymphosarcoma - Malignant neoplastic disorder of lymphoid tissue. 

Macrophthalmia - Abnormal enlargement of the eyeball. 

Malignant hyperthermia - A progressive increase in body temperature, muscle rigidity and metabolic 
acidosis, leading to rapid death. 

Mannosidosis, alpha - A lysosomal storage disease in which there is a build-up of mannose-rich 
compounds. 

Megacolon - Also known as Hirschsprung disease. A disorder in which the large intestine undergoes a 
large dilation and fills with faecal mass. 

Membranoproliferative glomerulonephritis type II - A progressive inflammation of the capillary loops in 
the glomeruli of the kidney. 

Microphthalmia - Abnormal smallness in all dimensions of one or both eyes. 
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Miniature - A type of pig that is small in body size; usually less than 100 kg at maturity. 

Muscular hypertrophy - Abnormal increase in muscular tissue caused entirely by enlargement of existing 
cells. 

Neonatal diarrhea - Diarrhoea in newborn piglets, caused by E. coli K88 or K99. 

Nephropathy - Disease of the kidneys. 

Nipples, asymmetrical numbers - Asymmetrical numbers of nipples. 

Nucleoside transport defect - Defect in the transport of nucleosides (purine or pyrimidine base attached 
to a ribose or deoxyribose sugar) across erythrocyte membranes. 

Osteochondrosis - Abnormal differentiation of growth cartilage. 

Otitis interna, susceptibility to - Susceptibility to inflammation of the inner ear, i.e. the vestibule, cochlea 
and semicircular canals. 

Otitis media, susceptibility to - Susceptibility to inflammation of the middle ear. 

Persistent frenulum preputii - A close attachment of the penis to the prepuce, resulting in inadequate 
protrusion of the penis. 

Polydactyly - Having more than the normal number of digits. 

Porcine stress syndrome (PSS) - A syndrome, commonly initiated by extreme physical stress such as 
fighting, marked by difficult breathing, increased rate of respiration, blanching and reddening of the 
skin, and followed by cyanosis and acidosis. The symptoms include total collapse, muscle rigidity and 
extreme hyperthermia, and generally result in death. It is caused by a single autosomal recessive 
gene, called the ryanodine receptor gene. 

Porphyria, congenital erythropoietic - A rare inborn error of porphyrin-haem synthesis inherited as an 
autosomal recessive trait. 

Porphyria cutanea tarda - The most common subtype of porphyria. The disorder results from low levels 
of the enzyme responsible for the fifth step in haem production. 

PSE - Refers to pork that is pale, soft and exudative (watery surface) and may result from porcine stress 
syndrome. 

Renal hypoplasia - An abnormally small kidney. May be bilateral or unilateral. 

Respiratory distress syndrome - A disorder characterized by breathing problems in newborn pigs. 

Retinitis pigmentosa - A group of genetic eye disorders. 

Rhabdomyolysis - Disintegration of muscle fibres, with consequent excretion of myoglobin in the urine. 

Rhinitis (RHIN) - Inflammation of the nasal mucosa. 

Syndactyly - Fusion of the claws (cleats or digits). Occurs in one, two, three or four legs. 

Testicular feminization - An abnormality of sexual development in which affected individuals have an XY 
chromosomal constitution, undescended testes and female secondary sexual characteristics (includ¬ 
ing female external genitalia). 

Thrombocytopenic purpura, autoimmune - Characterized by a low platelet count, normal bone marrow 
and the absence of other causes of thrombocytopenia. 

Thrombopathia - A disorder of blood coagulation resulting from a failure of ADP release from platelets on 
stimulation by aggregating factors such as thromboplastin. 

Twinning, conjoined - Congenital disorder in which twins are partly joined. 

Urolithiasis - Formation of calculi (‘stones’) in the urinary tract, or the condition associated with the pres¬ 
ence of such calculi. 

Ventricular septal defect - A congenital heart defect characterized by persistent patency of the ventricu¬ 
lar septum, permitting flow of blood directly between ventricles, bypassing the pulmonary circulation 
and resulting in various degrees of cyanosis. 

Vitamin D-deficiency rickets, type I - Inherited deficiency of the enzyme 1-hyroxylase.The most notice¬ 
able effects include a failure of calcification of bones (leading to bowing of limbs) and delayed 
dentition. 

Von Willebrand disease - The most common hereditary coagulation abnormality. 

Wilms tumor - A rapidly developing malignant mixed tumour of the kidneys. 

Wilson disease - A disorder of copper metabolism due to a deficiency of ceruloplasmin, which forms a 
complex with copper. The excess copper is deposited in the brain, causing mental retardation, or the 
liver, causing jaundice and cirrhosis. 
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activation-induced cytidine deaminase (AID) 117 
adeno-associated virus (AAV) vectors 250 
ADG see average daily gain 
adrenocortical function 208, 210-211 
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aggression 211-213 
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allelic diversity 315 
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pigs as 246, 253-254, 429-432, 433-434, 
436-437 
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aplasia of the tongue 54 
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genetic resources 307 , 311 
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ataxia, progressive 54 
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ATP1A1/ATP1B1 genes (ATPase) 85 
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330-335 
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heterosis and 347 
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B cell receptors (BCRs) see immunoglobulins 
Ba Xuyen breed 452 

Babyrousa (babirusa) spp. 3, 4-5, 148, 150 
back test 210 
backcrossing 409 
backfat thickness 

correlation with reproduction traits 343, 
344-345 

feed intake and 328 
heritability/correlations with other 

performance traits 330, 331, 332, 333, 
334, 335 

as measure of carcass composition 326, 357, 

358, 360 

QTLs 366-369 

selection experiments 335-336, 336-337 
variations in terminology 477 
background selection 409 

BACs (bacterial artificial chromosomes) 154-155, 

156, 185 

contig maps 155-156, 185-186, Plate 3 
in genome sequencing 187-188 
Bantu breed 454 
basal cell carcinoma 435 
Bazna breed 450 

BCRs (B cell receptors) see immunoglobulins 
bearded pig (Sus barbatus) 8, 149 
behaviour genetics 200-213 

aggression 205-206, 207, 211-213 
breeding programmes 202, 405 
emotional behaviours 208-211 
feeding and drinking 202-204 
maternal behaviours 205-208 
pig well-being and 201-202 

QTLs 201-202, 203-204, 206, 211 
sexual behaviours 204-205 
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Belt (I Be ) allele 41, 46 
Bentheim Black Pied breed 450, Plate 18 
Berkshire breed 44, 376, 449, 460, Plate 13 
best linear unbiased prediction 

(BLUP) 400, 402 

biodiversity see genetic diversity 
biomedical sciences 426-437 

animal models of human disease 427-429, 
432-433 
cancer 434-436 

pigs as 246, 253-254, 429-432, 
433-434, 436-437 
retinitis pigmentosa 250 
production of heterologous 
proteins 245-246 
research uses 146, 432, 436-437 
xenotransplantation 78-79, 253, 431 
birth weight 225, 226 
black coat colour 42-45 


Black Slavonian breed 450 

black spotting (E* 7 ) allele 42-44 

blastocysts 268, 269, 276-277 

blood groups 54-55 

blood parameters, QTLs 122 

BLUP see best linear unbiased prediction 

boar taint 359, 372-373, 405 

boars 

performance testing stations 400 
reproduction traits 222-223, 223, 224 
correlation with growth rates 227 
in crossbred animals 221 

QTLs 228, 229 

reproductive biology 205,219-220, 
265-266 

sire lines 394, 397, 400-402 
Brachyury (T) gene 278, 279, 285 
breed organizations 

breed societies 394-395, 396, 446 
breeding companies 395, 396, 459-460 
breeding 390-417 

behavioural traits 202, 405 
breeding objectives 326, 392, 

397-399, 404 

breeding pyramid 326, 392, 393-394, 
404-405 

environmental influences 326-327, 404 
GS 414-416 
MAS 407-414 

phenotypic selection, limitations of 406-407 
principles of genetic improvement 391-392 
programme designs 400-406 
selection indexes 325-326, 392, 398-400, 
410-411 

see also breed organizations; breeds; 
crossbreeding 
breeds 445-461 

allelic diversity 315 

coat colour as a marker 46-47 

conservation 309-311, 315, 446-447 

definition of 445-446 

degree of endangerment 308-309, 315 

descriptions of 447, 448-454, Plates 4-27 

distinctiveness (D,) 315 

genetic diversity 312-314 

history of 316, 445 

identification and traceability 47, 316-317, 
373-374, 414 

inventories 307-308, 394-395, 446 
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traits 374-376 

reproduction traits 220-221 

research information sources 447, 455-459 
synthetic (composite) 394, 409-410, 
459-460 

taxonomic (phylogenetic) trees 316, 317, 
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see also breed organizations; breeding; 
crossbreeding 
British Lop breed 449 
Brown (B) locus 39, 46 
BTG gene family (B-cell translocation) 289 
Bulgarian White breed 450 
bush pig ( Potamochoerus larvatus) 5-6, 148 

C-value paradox 74 

CAGE (cap analysis gene expression) 184 

calcium transient 266 

calpastatin 364 

Canada 393, 395 

cancer 

lymphosarcoma in the pig 58 
pig as a model for human disease 434-436 
cannibalism 205-206 
Cantonese breed 452 
carcass length 358, 360, 369 
carcass traits 357 

breed differences 374-376 

correlation between carcass traits 359, 360 

correlation with meat quality traits 360, 362 

correlation with reproduction traits 227 

crossbreeding and 376 

genes affecting 364-365, 413 

heritability of 357, 358, 359 

QTLs 365-370 

traceability 47, 373-374, 414 

see also backfat thickness 

cardiovascular disease, pig as a model for 430, 434 

casein gene family 86 

CAST gene (calpastatin) 364 

castration 405 

cathepsins 365 

cattle-pig zoo-FISH mapping 159 

Qayonu Tepesi (Anatolia) 20 

CCKAR gene (cholecystokinin-A receptor) 341 

cDNA probes 181-182, 184 

CDR3 (complementarity determining region) 

116, 117 

Cdx2 gene 277, 280 

cell polarity 274, 277, 279-280 

centric fusion (Robertsonian) translocations 

144, 147 

centromeres 76-77, 137 
Cetartiodactyla 1-2 

CGH see comparative genetic hybridization 
chemokines 283 

Chester White breed 448, Plate 4 
chimerism 145 
Chinese pigs 

breeds 452-453, 457-458, Plates 22-27 
coat colour 47 
domestication 19, 23-27 
see also Meishan breed 


cholecystokinin-8 (CCK) 338-339, 340, 341 
cholesterolaemia 57 
chorion 281 

chromosome maps 150-160, 185-186, 187 
immunoglobulins 111 , 112 
MHC classes I and II 106, 107-108 
TCR genes 120 
chromosomes 

abnormalities 139-147, 192-193, 293 
breeding out 406 
banding studies 135-137 
cloned pigs 147 
comparative studies 

human-pig 158-159, 189-190, Plate 2 
Sus scrofa subspecies 7 
Suidae 147-150 

FISH analysis 137-139, 157-159 

karyotypes 

domestic pig 136, 146 
translocation in a feral pig 150 

NORs 135-137 
telomeres 83-84, 137, 147 
clone libraries 154-155, 156 
cloning 

biomedical research 432 

cytogenetics 147 

as method of reproduction 245, 

247-248, 397 
tissue samples 183 
cMos knockout mice 266 
CNVs see copy number variants 
coagulation disorders 64, 67 
coat colour 32, 38-47, 39, 374, Plate 1 
colostrum 219, 226-227 
colour of meat 357, 370, 375 
commercial organization of the pig industry 

392-395, 396, 459-460 
comparative chromosome maps 158-159, 
189-190, Plate 2 

comparative genetic hybridization (CGH) 139 
composite (synthetic) breeds 394, 409-410, 
459-460 

conception rates 219, 221, 227 
congenital tremor syndromes 65 
conservation of genetic diversity 309-311, 315, 
446-447 

conserved synteny 158-159, 189-190, Plate 2 
contig maps 155-156, 185-186, Plate 3 
cooking loss 360, 361 

copy number variants (CNVs) 87, 139, 192-193 
corpus luteum 243, 282 
cortisol 211 

CRC locus (calcium release channel) 87 
Cre recombinase 251 
crossbreeding 

breeding programmes 392, 403-404, 405 
marker-assisted selection and 408-410, 411 
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crossbreeding (continued) 

meat quality and 376-377 
performance traits and 346-347 
reproduction traits and 221-222 
cryopreservation of gametes and embryos 244, 
310-311, 315 
cysts, renal 63 

cytochrome P450 gene family 66 , 372, 434 
cytogenetics see chromosomes 
cytokines 282-283 
cytoskeleton, in the embryo 277 
Czech Improved White breed 450 


Da Min breed 452, Plate 22 
dam lines 39, 394, 397, 402-403 
DAX1 gene 293 
DDX4 gene 290 
deepCAGE 184 

degree of endangerment (DE) 308-309, 315 
demethylation of DNA 251, 266, 275 
Denmark 393, 395 
Dermantsi Pied breed 450 
dermatosis vegetans 55 
developmental genetics 263-294 
embryogenesis 266-271 
gametogenesis 219, 264-266 
genetic control 

implantation and placental 
development 281-285 
post-implantation 285-289 
pre-implantation 272-281 
sex differentiation 64, 145, 289-293 
diarrhoea, neonatal 61 
Dilution (D) locus 39, 46 
disease resistance 109-110, 122, 253 
distinctiveness of breeds (D) 315 
diversity see genetic diversity 
DNA 

cDNA probes 181-182, 184 
GC content 75-76 , 91 
methylation/demethylation 91, 251, 266, 
275, 276 

mitochondrial 92-93 

repetitive sequences 75-76, 186-187, 188 
centromeric repeats 76-77, 137 
CNVs 87, 139, 192-193 
expressed sequences 84-87 
microsatellites 82-83, 191, 317, 319 
minisatellites 81-82 
telomeric repeats 83-84, 137, 147 
transposons 77-81 
single-copy 87 
DNA-combing 157 
domestication 15-16 

of Sus scrofa 14, 16-33, 445 


Dominant black ( E D ) allele 42, 43 

Dominant white (I) locus 39-42, 41, 46 

DRB locus (MHC class II) 86 

dressing percentage 357, 358, 360, 369-370 

drinking behaviour 204 

drip loss 360, 361, 363 

Duroc breed 394, 448, Plate 5 

production traits 336, 375, 376 

reproduction traits 221 
dwarfism 56 


eating quality traits 357, 358, 360, 361, 364, 
371-372 

EBVs see estimated breeding values 
economic considerations 

economic weights of breeding traits 326, 
398-399, 403 

marker-assisted selection 411-412 
ectoderm, embryonic development 285 
EDNRB gene (endothelin receptor beta) 47 
effective population size (NJ 308 
Egypt 31 
Eif41b gene 273 
electroporation 248 

embryonic genome activation (EGA) 273, 

274-275 

embryonic stem (ES) cells 248, 252 
embryos 

cryopreservation 244 
cytogenetic analysis 147 
death of 219, 270-271 
embryogenesis 266-271 
genetic control of development 

implantation and placentation 281-285 
post-implantation 285-289 
pre-implantation 272-281 
sex differentiation 64, 145, 289-293 
sexing of 76-77 

transfer (ET) 244-245, 396-397 

emotional behaviours 208-211 
endoderm, embryonic development 278, 279, 285 
environmental influences on breeding 
outcomes 326-327, 404 
Eomes gene 277, 280 
epigenetics 76, 91 

in early embryos 275-276 
in gametogenesis 266 
imprinting 276, 412, 413 
in SCNT 251-252 
epitheliogenesis imperfecta 56 
Escherichia coli F4 (formerly K88) 61 
ESR (estrogen receptor) gene 231, 413 
estimated breeding values (EBVs) 400, 402, 

408, 410 

genomic (G-EBV) 415, 416 
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mothering ability 205 
pig survival 208 

ESTs see expressed sequence tags 
ET see embryos, transfer 
ethical concerns 201, 398-399, 405 
Eurasian wild boar (Sus scrofa) 6-7, 9, 10, 

148-149 

domestication 14-33 
European pigs 

breeds 449, 450-451, 455-456, 

Plates 13-21 
coat colour 44, 47 
domestication 17-18, 19, 20, 21-23 
genetic resources 307, 311, 316 
see also individual breeds 
evolution 2, 3, 6, 7, 10, 316 

molecular genetics 80-81, 159 
evolutionary break-point regions (EBRs) 143 
expressed sequence tags (ESTs) 180 
expression profiling 180-184 
Extension (E) locus 39, 42-45, 43 
extinction risk 308-309, 315 
extra-embryonic membranes 281-282 
extrapolation (calculation of allelic 
richness) 314-315 
eye development 288 

farrowing 219, 226 

fat content of meat 357, 358, 359, 360, 361, 365 
fatty acid composition 372 
marbling 371, 375 
QTLs 366-369 

see also backfat thickness 
feed conversion rate (FCR) 331, 334, 334, 335, 
343 

feed intake (FI) 331, 333, 334, 335 

physiology of high versus low FI lines 339 
relationship with lean tissue gain 327-330 
selection experiments 335, 336, 337 
see also residual feed intake 
feeding behaviours 202-204 
females see sows 
Fengjing breed 452, Plate 23 
fertilization 219, 266 
FI see feed intake (FI) 
fibre-FISH 157 
fibroblasts 

in vitro transformation 435, 436 
SCNT 251 

fingerprinted contig (FPC) maps 155-156, 186 
FISH see fluorescence in situ hybridization (FISH) 
flow sorting of chromosomes 137-138, 158 
fluorescence immunocytochemistry 139 
fluorescence in situ hybridization (FISH) 137-139, 
156-159 


folliculogenesis 264, 265, 290 
foreground selection 409 

forest hog ( Hylochoerus meinertzhageni) 5, 148 
fragile sites (FS) 144-145 
freezing of gametes and embryos 244, 310-311, 
315 


G-banding 136 

G-EBV (genomic estimated breeding values) 415, 

416 

G-value paradox 74-75 
gametes 

cryopreservation 244, 310-311, 315 
gametogenesis 219, 264-266 
see also oocytes; sperm 
gametic imprinting 276 
gangliosidosis 56 

gastrointestinal disease, pig as a model for 430, 
434 

gastrulation 269, 278-279 

GC content of DNA 75-76, 91 

gene banks 310-311,315 

Gene Expression Omnibus (GEO), microarrays 

181-182 

gene hitchhiking 318 

gene knockoutAnock-in (gene targeting) 248-251 
gene mapping 150-160, 185-186, 187 
immunoglobulins 111 , 112 
MHC classes I and II 106, 107-108 
TCR genes 120 

gene nomenclature 473-475, 480 

gene silencing 292 

genetic diversity 306-320, 392 

breed inventories 307-308, 394-395, 446 
conservation 309-311, 315, 446-447 
extinction risk 308-309, 315 
genomic variation 190-193 
markers 311-317 
marker-trait associations 318-320 
genetic engineering see cloning; transgenics 
genetic evaluation 391-392, 400, 402 
genetic improvement 390-417 
behavioural traits 202, 405 
breeding objectives 326, 392, 397-399, 404 
breeding programme designs 400-406 
breeding pyramid 326, 392, 393-394, 
404-405 
GS 414-416 
MAS 407-414 

phenotypic selection, limitations of 406-407 
principles of 391-392 
role of assisted reproduction 242-245, 
396-397 

role of breeding organizations 394-395, 

396, 459-460 
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genetic improvement ( continued) 

selection indexes 325-326, 392, 398-400, 
410-411 

see also crossbreeding; domestication 
genetic lag 392, 394, 397 
genomic imprinting 276, 412, 413 
genomic libraries 154-155, 156 
genomics 179-193 

in biomedical research 436-437 
comparative 158-159, 189-190, Plate 2 
databases 180, 181, 186, 188, 192, 228, 
480 

eQTLs 373 
ESTs 180 

expression profiling 180-184 
genomic postulates 427 
GS 414-416 

ncRNA 74, 89-90, 184-185 
reproduction traits 232 
sequencing 155-156, 179, 186-189, 193 
variation within the species 190-193 
see also chromosome maps; molecular 
genetics 

genotype nomenclature 475 

gilts, reproductive biology 219, 227, 243 

glomerulonephritis 60 

Gloucestershire Old Spot breed 449, Plate 14 
glycogen metabolism 357, 363 
gonadotrophins 243 
Goosecoid (GSC ) gene 278 

GPI pseudogene (glucosephosphate isomerase) 81 
growth factors see insulin-like growth factors; 

vascular endothelial growth factor 
growth hormone 87-88, 91 
growth rates 

correlation with reproduction traits 227, 

342-346 

covariation of performance traits 330-335 
heterosis and 347 

model of relationships between feed intake 
and tissue gain 327-330 
physiology and molecular biology of 338-342 
selection experiments 335-338, 341, 
344-345 

transgenic pigs 252-253 
GS (genomic selection) 414-416 
Guinea breed 454 

H-FARP gene (heart fatty-acid binding protein) 371 
hair, curly 67 
hairlessness 57-58 

HAL locus (halothane sensitivity) ( RYR1 gene) 59, 
209-210, 232, 361-363, 369, 374 

ham, variation in processing yield see RN locus 
Hampshire breed 363, 375-376, 448, Plate 6 


Hausdorff distance 312 

heart, embryonic development 288 

Hereford breed 448, Plate 7 

heterochromia iridis 56 

heterosis 221-222, 346-347, 376-377 

heterosynapsis, early 145-146 

Heude’s pig (Sus bucculentus) 4, 150 

Hezuo breed 452 

hind limb paralysis 57 

histones 76, 252, 275-276 

HOX (homeobox) genes 286, 287 

human approach test 210 

human-pig comparative maps 158-159, 

189-190, Plate 2 

hybrid vigour (heterosis) 221-222, 346-347, 
376-377 

hybridization, between Sus species 9-10, 28, 30 
Hylochoerus meinertzhageni (giant forest hog) 

5, 148 

hyperacute rejection 253, 431 
hypercholesterolaemia, spontaneous 57 
hypothalamic-pituitary-adrenal (HPA) axis 208, 
210-211 

hypotrichosis 57-58 

Iberian pigs 47, 374, 450 

ICSI see intracytoplasmic sperm injection 

IGFs see insulin-like growth factors 

IGH locus (heavy chain immunoglobulins) 86, 

111-114, 115, 116-117 

IGL/IGK loci (light chain immunoglobulins) 88-89, 
115-116 

Illumina 60K Beadchip 193 
immotile short-tail sperm (ISTS) defect 64 
immunization 110 
immunogenetics 101-124 

BCRs see immunoglobulins 
innate immunity 103-105, 121 
MHC (SLA) 86, 102, 105-110, 186 
quantitative trait loci (QTLs) 109, 121-124 
TCRs 89, 102, 105, 117-121 
transplant rejection 253, 431 
immunoglobulins 89, 102 

heavy chains (IGH genes) 86, 111-114, 115, 
116-117 

light chains ( IGL and IGK genes) 88-89, 
115-116 

response associated with SLA genes 110 
somatic recombination 88-89, 105, 115-117 
subtypes 111 
IMNpRH2 panel 154 
implantation 281-285 
IMpRH panel 153-154 
imprinting 276, 412, 413 
inbreeding 308, 310, 392, 402, 407 
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index selection 325-326, 392, 398-400, 
410-411 
India 9, 30-31 
Indochina 19, 25, 27-29 
Indonesia 5, 8, 10, 19, 29-30 
infanticide 205-206, 207 
inherited disorders 51-53, 54-67 
inner cell mass (ICM) 269, 277 
INRA (Institut National de la Recherche 
Agronomique) 140, 457 
insulin-like growth factors (IGFs) 

performance traits and 338, 339, 339-340 
IGF2 in carcass composition 
364-365, 413 

in pregnancy and the fetus 283-284, 289 
interchromosomal effects (ICEs) 146 
interferon 282 

interferon-stimulated genes (ISGs) 282-283 
Internet resources 

breeds 307, 447 

gene nomenclature 474, 475 

genome databases 180, 181, 186, 

188, 192, 228, 480 

inherited traits and disorders 52-53, 476 
ISGs 282 

pig models of human disease 436 
SLA polymorphisms 108 
trait nomenclature 476, 478 
intersexuality 145, 293 
intracytoplasmic sperm injection (ICSI) 247 
introgression 409, 413 
inventories of genetic resources 307-308, 
394-395, 446 

inversions (chromosomal) 144 

ISEA (Island South-east Asia) 5, 8, 10, 19, 29-30 

isochores 75-76, 91 

Italy 19, 22, 23 

Javan warty pig (Sus verrucosus) 8, 149 
Jinhua breed 452 

kallikreins 86 
karyotype 

domestic pig 136, 146 
translocation in a feral pig 150 
Kele breed 452 
keratinocytes 436 
kidney disorders 60, 63 
kinetochores 76 

KIR gene (killer immunoglobulin receptor) 102 
KIT locus (tyrosine kinase receptor) 40-42, 41, 
374 

Krskopolje breed 450 
Kunekune breed 454 


Lacombe breed 448 
Laconie breed 450 
lactation 219, 226-227 
transgenic sows 252 
Landrace breed 394, 450, Plate 19 
feeding behaviour 203 
production traits 333, 375 
reproduction traits 220-221, 221 
Lanyu (Taiwan) 26 
Large Black breed 449 
Large Black-White breed 452 
Large White breed 394, 449 

crossbred with Meishan 222 
production traits 333, 336, 376 
reproduction traits 220, 221 
see also Yorkshire breed 
LD see linkage disequilibrium markers 
LE see linkage equilibrium markers 
lean tissue, relationship to feed intake 327-330 
lean tissue feed conversion (LTFC) 329, 331, 333, 
348 

correlation with reproduction traits 345, 346 
selection experiments 337-338, 341, 345 
lean tissue growth rate (LTGR) 329, 331, 333 
correlation with reproduction traits 345, 346 
physiology of high versus low LTGR lines 339 
selection experiments 336, 337-338, 341, 345 
lean-to-fat ratio 360, 362, 448-454 
left-right asymmetry 280 
legless 58 

LEPR gene (leptin receptor) 229, 369 
leptin 339, 340 

Lewontin-Krakauer (LK) test 318 
libido, male 205 

LILR gene family (leucocyte immunoglobulin-like 
receptor) 102 

limb abnormalities 54, 57, 58 

limb morphogenesis 288 

LINEs see long interspersed nuclear elements 

linkage disequilibrium (LD) markers 320, 407-408 

linkage equilibrium (LE) markers 407, 408 

linkage maps see chromosome maps 

Lithuanian Native breed 451 

litter size 218 

breed differences 220 
breeding programmes 402, 403 
chromosomal abnormalities and 140, 144 
correlation with other traits 227, 342-343, 
344-345, 403 

genetic effects 222, 223-224, 225-226 
QTLs 229-230 

litter survival see piglet survival 

locus nomenclature 474-475 

loin muscle dimensions 326, 357, 360, 370 

long interspersed nuclear elements (LINEs) 

78, 79, 187 
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long terminal repeat (LTR) elements 76, 186 
LTFC see lean tissue feed conversion 
LTGR see lean tissue growth rate 
LTR see long terminal repeat elements 
Luchuan breed 452, Plate 24 
lymphosarcoma 58 

males see boars 

malignant hyperthermia 59, 361-362 
see also RYR1 gene 
Mangalitsa breed 45, 46, 451, Plate 20 
MAP kinase 266, 289 
marbling 371, 375 

marker-assisted selection (MAS) 407-414 
maternal behaviours 205-208 
maternal imprinting 276, 412, 413 
MC1R gene (melanocortin-1 receptor) 32, 

42-45, 374 

MC4R gene (melanocortin-4 receptor) 341, 

365, 366 

meat quality 355-377 

breed differences 374-376 

coat colour and 39 

crossbreeding 376-377 

genes affecting 59, 360-364, 365, 413 

marker-assisted selection 411 

QTLs 365-366, 367-368, 370-373 

reproduction traits and 228 
traits associated with 356-357 

correlation with carcass composition 
traits 360, 362 

correlations between 359-360, 361 
heritability 357, 358, 359 

medical uses see biomedical sciences 

meiosis 139, 264, 265-266, 266 
meiotic linkage maps 151-152 
Meishan breed 452, Plate 25 

crossbred with Large White 222 
production traits 375, 376 
reproduction traits 220 
melanocortin receptor genes 

MC1R 32, 42-45, 374 
MC4R 341, 365, 366 

membranoproliferative glomerulonephritis 
type II 60 

mesoderm, embryonic development 278, 279, 285 
methylation 

of DNA 91, 251, 266, 275, 276 
of histones 275-276 
MHC (major histocompatibility complex) 

86, 102, 105-110, 186 

microarrays 180, 181-182 
microdissection 138-139, 157-158 
microRNA (miRNA) 74, 86, 90-91, 184, 251 
microsatellites (MS) 82-83, 191, 317, 319 
Middle White breed 449, Plate 15 


milk production 219, 227 
transgenic sows 252 
minisatellites 81-82 
Minzhu breed 452 
miRNA see microRNA 
mitochondrial genome 92-93, 192 
markers of domestication 18, 21, 

25, 26, 30-31, 31-32 
MLEs (Mariner- like elements) 77 
molar tooth morphology, domestication 

and 27, 30 

molecular genetics 73-93 

behavioural traits 201-202, 203-204, 

206, 211 
breeds 460 
coat colour 38-47 
developmental 266-267, 272-294 
gene structure and function 87-91 
genetic diversity markers 311-320 
genome size 74-75, 91-92, 188 
identification and traceability 47, 316-317, 
373-374, 414 

immune system 86, 88-89, 102-124 
marker-assisted selection 407-417 
meat quality/carcass composition 

360-373, 377 

mitochondrial genome 92-93 
ncRNA 74, 89-91, 184-185 
performance traits 340-341 
repetitive DNA 75-76, 186-187, 188 
centromeric repeats 76-77, 137 
CNVs 87, 139, 192-193 
expressed repetitive sequences 84-87 
microsatellites 82-83, 191, 317, 319 
minisatellites 81-82 
telomeric repeats 83-84, 137, 147 
transposons 77-81 
reproduction traits 228-232 
single-copy DNA 87 
see also genomics 
Mong Cai breed 452, Plate 26 
Mora Romagnola breed 451 
morphology 51, 52 

Babyrousa spp. 4-5 
Hylochoerus meinertzhageni 5 
Phacochoerus spp. 5 
Potamochoerus spp. 5-6 
Sus spp. 8, 9 
motor neuron disease 60 
Moura breed 448 
mRNA (messenger RNA) 88 
oocyte 267, 272-274 
RNAseq 183-184 

MS see microsatellites 

MTTP gene (microsomal triglyceride transfer 
protein) 365 
Mukota breed 454 
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Mulefoot breed 448 
muscles 

conversion to meat 356 
see also RN locus 
growth and relationship with feed 

intake 327-330 
myogenesis 288-289 
progressive myopathy 62 


Napole yield 357, 361 
see also RN locus 

national breeding programmes 393, 395 
ncRNA (non-coding RNA) 74, 89-90, 184-185 
Near East 19, 20-21, 23 
Neijiang breed 452 

Neolithic Age, pig domestication 16-33 
neonatal diarrhoea 61 

neuroendocrine responses to stress 208, 210-211 

neuropeptide Y (NPY) 339 

neurulation 269, 279 

neutrality of markers, tests for 318 

New Guinea 8, 20, 29, 30 

next-generation sequence technology 183-184 

Ningxiang breed 453 

NOD genes (NLR (nucleotide-binding domain, LRR- 
containing) proteins) 104-105 
Nodal signalling 279, 285 
nomenclature 473-480 

non-economic values of breeding traits 201, 399 

NORs see nucleolus organizer regions 

notochord 279 

NPY see neuropeptide Y 

NROB1 (DAX1) gene 293 

nuclear mitochondrial sequences (numts) 93 

nucleolus organizer regions (NORs) 85, 135-137 

nucleoside transport defect 61 

nucleus herds 326, 393-394 

nursing behaviour 219 


obesity, pig as a model for 431, 432 
OBO-Edit ontology editor 478, 479 
Oceania 

breeds 454 

pig domestication 28, 29, 30 
OCT4 gene 278, 280 
oedema (myxoedema) 61 
oedema disease (gut oedema) 63 
oestrogens, produced by the conceptus 282, 283 
oestrus 205, 219, 225 

synchronization of 243, 244 
oncogenes 435, 436 

proto-oncogenes 277 
Online Mendelian Inheritance in Animals 
(OMIA) 52-53, 476 


Online Mendelian Inheritance in Man (OMIM) 52 

ontologies, for traits 478-479 

oocytes 

cytogenetic analysis 146-147 
imprinting 276, 412, 413 
mRNA and control of embryonic 
development 267, 272-274 
oogenesis 219, 264-265, 266 
transduction 247 
organogenesis 285, 286-288 
Ossabaw Island breed 448 
ovary, embryonic development 289-290, 292 
ovulation 219, 264 

rate 225, 228-229, 230, 252 
synchronization of 244 
Oxford Sandy and Black breed 449 

Pacific Clade 25, 28-30 

PAGs see pregnancy-associated glycoproteins 

painting probes 138 

palaeogenetics 17-20, 21-22, 23, 24-26, 28-33 

Pampa Rocha breed 448, Plate 11 

Patch (I p ) allele 40 

paternal imprinting 276 

Pax genes 288 

peccaries 2 

pedigree analysis 309-311,413 
pedigree records 394-395 
performance traits 325-348 
by breed 448-454 
correlations with reproduction traits 

342-346, 403 
crossbreeding and 346-347 
genetic variation/covariation 330-335 
heritability estimates 331-332 
physiology and molecular biology 

338-341, 342 

selection experiments 335-338, 341, 
344-345 

selection methods 325-330, 399-400 
see also individual traits 
PERVs see porcine endogenous retroviruses 

PG 600 243 

pH of meat 370-371 

as a quality indicator 357, 358, 360 
Phacochoerus spp. (warthogs) 3, 5, 148 
pharmaceutical research 433, 434 
phenes 54-67 
phenotype nomenclature 480 
Philippine warty pig (Sus philippensis) 8, 150 
phylogeny 1-2, 3-4, 7 
physiology, pigs as biomedical models of 430 
PI genes (protease inhibitors) 86 
Piau breed 448, Plate 12 
Pietrain breed 394, 451, Plate 21 
meat quality traits 375, 376 
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Pietrain breed (continued) 
reproduction traits 221 
research information sources 459 
piglet birth weight 225, 226 
piglet survival 

behavioural factors affecting 205, 207, 208, 
219 

breed differences 220 
correlation with performance traits 343 
economic value in breeding 
programmes 403 

genetic effects 208, 222, 223-224, 
226-227, 230 
piglets, striped coats 45 
PigQTL database 124, 228, 340, 366, 477 
Pink-eyed dilution (P) locus 39, 46 
placental development 281-282 
Poland China breed 448, Plate 8 
polarity of embryonic cells 274, 277, 279-280 
polydactyly with otocephalic monster 62 
porcine endogenous retroviruses (PERVs) 

78-79, 253 

porcine stress syndrome (PSS) 59, 361-362 
PorcineSNP60 Beadchip 319, 377 
Porcula salvanius (pygmy hog, Sus salvanius) 

9, 149 

pork value chain 392-393, 404 
porphyria 62 

post-weaning feeding 203, 327-330 
see also performance traits 
Potamochoerus spp. 3, 5-6, 148 
PPAG genes (pregnancy-associated 
glycoproteins) 284-285 
PRE-1 (porcine repetitive element-1) 79-81 
pregnancy, maternal recognition of 282 
pregnancy-associated glycoproteins 

(PAGs) 284-285 

prenatal mortality 219, 226, 230, 270-271 
PRKAG3 gene 363-364, 370-371, 375, 376 

see also RN locus 

processing meat quality traits see meat quality 
production traits see carcass traits; feed intake; 

growth rate; meat quality 
profit function method 398 
progesterone receptor agonists (altrenogest) 243 
prolificacy see litter size 
pronuclear injection 246-247 
protamine-2 deficiency 63 
protease inhibitors 86 
protectiveness, maternal 206 
proto-oncogenes 277 

provenance of meat products 47, 373-374, 414 
PSE (pale, soft, exudative) meat 
condition 59, 361 
Pseudofam database 81 
pseudogenes 81, 86-87, 93, 107 
puberty 219, 225, 227, 228, 230 


correlation with performance traits 343-344 
pygmy hog (Sus salvanius or Porcula 
salvanius) 9, 149 


QTLs (quantitative trait loci) 

behavioural 201-202, 203-204, 206, 211 
expression QTLs (eQTLs) 373 
gene maps 152, 155 
health traits 109, 121-124 
imprinting 276, 412 
linkage with genetic markers 318-320, 
407-408, 412 

meat quality and carcass 
composition 365-373 
performance traits 340 
PigQTL database 124, 228, 340, 366, 477 
reproduction traits 228-232 


radiation hybrid (RH) mapping 153-154 
rarefaction (calculation of allelic richness) 314 
rebreeding 225, 227, 344 
recombination 
homologous 

in construction of linkage maps 151, 
152 

in genetic modification 248, 250 
site-specific 250-251 

somatic (immune system genes) 88-89, 102, 
105, 115-117 

red coat colour 42 

red river hog (Potamochoerus porcus) 5, 148 
Red Wattle breed 454 
reflectance of meat 357, 361 
renal disorders 60, 63 
reproduction/reproduction traits 218-232 

assisted 220, 242-245, 396-397 
behaviours associated with 204-205 
biology of 219-220 

gametogenesis 219, 264-266 
sex differentiation 64, 145, 289-292 
cloning 245, 247-248, 397 

correlations between reproduction 

traits 224-227 

correlations with other traits 227-228, 
342-346, 403 
genetic variation 220-224 
QTLs 228-232 
genomic studies 232 
transgenic pigs 252 
see also individual traits 
residual feed intake (RFI) 331, 333, 336, 
339-340 

see also feed intake 
respiratory distress syndrome 64 
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retinitis pigmentosa 250 

retinoic acid (RA) 286 

retroposons 78 

retrotransposons 77-81 

retroviruses 78-79, 253, 436 

reverse transcriptase 78, 79 

RFI see feed intake; residual feed intake 

RH (radiation hybrid) mapping 153-154 

ribosomal RNA (rRNA) 84-85, 92, 135-137, 275 

rickets 66 

RN locus (Rendement Napole) 59, 232, 357, 359, 
363-364, 369, 375 

RNA see microRNA; messenger RNA; non-coding 
RNA; ribosomal RNA; transfer RNA 
RNA-mediated transposons 77-81 
Roan (I Rn ) allele 42 

Robertsonian (rob) translocations 144, 147 
rolling behaviour 205 

RPSA pseudogene (ribosomal protein SA) 81, 87 
rRNA see ribosomal RNA 

RYR1 gene (ryanodine receptor 1) (HAL locus) 59, 
209-210, 361-363, 369, 374 

Saddleback breed 449 
SAGE see serial analysis of gene expression 
SC analysis see synaptonemal complex analysis 
SCH mapping see somatic cell hybrid mapping 
SCNT see somatic cell nuclear transfer 
segmentation 270, 285-286, 288-289 
selection indexes 325-326, 392, 398-400, 
410-411 

selective sweep 318 

semen 220, 244, 310-311, 315 

see also sperm 

sensory traits in meat quality 357, 358, 360, 

361,364 
QTLs 370-372 

serial analysis of gene expression (SAGE) 

182-183 

sex chromosomes 140, 290-292 
sex differentiation 64, 145, 289-293 
sex reversal 64, 145, 293 
sexing 

of embryos 76-77 
of sperm 396 
sexual behaviours 204-205 
SINEs (short interspersed nuclear elements) 78, 
79-81 

single nucleotide polymorphisms (SNFs) 
in genomic selection 414-415 
identification 191-192, 193 
as markers of genetic diversity 318-320 
in TLR genes 103-104 
sire lines 394, 397, 400-402 
skatole 359, 372 

skin diseases, pig as a model for 430, 435-436 


skin-on meat products 39 
SLA see swine leucocyte antigens 

SNP chips 154, 193, 319, 377 

SNPs see single nucleotide polymorphisms 
social behaviour 201-202, 210, 405 
societal concerns 201, 398-399, 405 
somatic cell hybrid (SCH) mapping 152-153 
somatic cell nuclear transfer (SCNT) 147, 245, 
247-248, 249, 251-252 
somatic hypermutation 102, 105, 117, 118 
somatic recombination 88-89, 102, 105, 

115-117 

sows 

dam lines 39, 394, 397, 402-403 
maternal behaviours 205-208 
reproduction traits 220, 221 , 222, 

223, 224 

correlations between 225-227 
crossbreeding 221, 222 
molecular genetics 228-231 
reproductive biology 219, 264-265 

synchronization of oestrus 243, 244 
spectratyping 117 
sperm 

cryopreservation 244, 310-311, 315 
cytogenetic analysis 139, 145-146, 147 
heritability of characteristics 222-223 
imprinting 276 
mRNA found in 267 
numbers in ejaculate 220, 224 
QTLs associated with 229 
short-tail 64 
sorting 396 

spermatogenesis 219, 265-266 
in transgenesis 247 

spots 

black spotting (E p ) allele 42-44 
Spotting (S) locus 39 
Spotted breed 448, Plate 9 
SRY gene 290-291, 292, 293 
stillborn piglets 219, 226, 230, 271 
stress response 208, 210-211 
stripes, in piglets 45 
Suidae 2-10, 147-150 
Suiformes 2 
Suinae 5-10 
Suini 3 

Sulawesi warty pig ( Sus celebensis ) 8, 10, 19, 
29-30, 149 
super-traits 476-477 
supply and distribution chain 392-393 
Sus genus 6, 9-10 

Sus barbatus (bearded pig) 8, 149 
Sus bucculentus (Heude’s pig) 4, 150 
Sus cebifrons (Visayan warty pig) 9, 149 
Sus celebensis (Sulawesi warty pig) 8, 10, 19, 
29-30, 149 


506 


Index 


Sus genus ( continued) 

Sus philippensis (Philippine warty pig) 8, 150 
Sus salvanius (pygmy hog, Porcula 

saluanius) 9, 149 

Sus scrofa (Eurasian wild boar) 6-7, 9, 10, 

148-149 

domestication 14-33 
Sus verrucosus (Javan warty pig) 8, 149 
Swabian-Hall Swine breed 451 
swine leucocyte antigens (SLA) (MHC antigens) 

86, 102, 105-110, 186 

Swine Testing and Genetic Evaluation System 

(STAGES) 400 

synaptonemal complex (SC) analysis 145-146 
syndactyly 64 

synteny, conserved 158-159, 189-190, Plate 2 
synteny maps 152-153 
synthetic (composite) breeds 394, 409-410, 
459-460 

systematics 1-10 

of domestic breeds 316, 317, 460, 461 


T cells 

adaptive immune response 106-107 
antigen receptors (TCRs) 89, 102, 105, 
117-121 

T-box genes 279, 286-288 
Homes 277, 280 
T (Brachyury) 278, 279, 285 
Tamworth breed 449, Plate 16 
tandem selection 410, 411 
tandemly repeated elements 81-85 
taxonomy 1-10 

of domestic breeds 316, 317, 460, 461 
Tayassuidae (peccaries) 2 
TCRs (T cell receptors) 89, 102, 105, 117-121 
teat number 227, 228 
telomerase 84 
telomeres 83-84, 137, 147 
tenderness of meat 357, 358, 361, 364, 371 
testis 

embryonic development 289-292 
size as a reproduction trait 224, 227, 229 
testosterone 223, 224, 227 
THA-banding 137 
thrombopathia 64 
Thuoc Nhieu breed 453 
Tibetan breed 453 

TLRs (Toll-like receptors) 103-104, 104 
TNFA/TNFB genes (tumour necrosis 
factor alpha/beta) 85 
Tongcheng breed 453, Plate 27 
tongue aplasia 54 
tonic immobility test 210 
toxicology research 433, 434 


traceability of meat products 47, 373-374, 414 
trait nomenclature 475-479 
transcription 87-88, 89 

embryonic (EGA) 274-275 
mitochondrial 92 

transcription factors 275, 279, 280-281, 
286-288, 291-292 
transcription start sites 184 
transgenics 243, 245-254 
agricultural uses 252-253 
biomedical uses 

disease models 246, 250, 253-254 
production of heterologous 
proteins 245-246 
xenotransplantation 253, 431 
methodologies 246-252 
translation, in mitochondria 92 
translocations 

reciprocal (rep) 140-143, 145-146 
Robertsonian (rob) 144, 147 
transplantation 78-79, 253, 431 
transposons 77-81 
tremor syndromes 65 
tRNA (transfer RNA) 85, 92 
trophoblast 280-281 
trophoectoderm 269, 277 
Turopolje breed 451 


UK, breeds originating in 449, 455-456, 
Plates 13-16 

uniformity at slaughter 406 
USA, breeds originating in 448, 455-456, 
Plates 4-10 


vaccination 110 

variation see genetic diversity 

vascular endothelial growth factor 

(VEGF) 284 

Vietnamese Pot Belly breed 453 
Visayan warty pig ( Sus cebifrons) 9, 149 
vitamin D-deficiency rickets 66 
von Willebrand disease 67 


warthogs ( Phacochoerus spp.) 3, 5, 148 
water-holding capacity of meat 361, 375 
wattles 67 

websites see Internet resources 
WeelB gene 273 

Weitzman method for analysis of genetic 
diversity 312-314 
welfare issues 201, 398-399, 405 
white coat colour 4, 39-42, 47, Plate 1 
White head (He) locus 46 
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wild boar ( Sus scrofa) 6-7, 9, 10, 148-149 
domestication 14-33 
woolly hair 67 

World Dictionary of Livestock Breeds 
Types and Varieties (Mason) 307 

X chromosome 140, 292 
xenotransplantation 78-79, 253, 431 
Xist gene 292 

Y chromosome 140, 290-291 


YAC vectors (yeast artificial chromosome) 154 
Yellow River Valley 24, 26-27 
yolk sac 281 

Yorkshire breed 448, Plate 10 
see also Large White breed 

ZAR1 gene 273 
zinc finger nucleases 

(ZFNs) 250 
zona pellucida 269 
zoo-FISH 158-159, Plate 2 
zygotes 267, 268, 273-274 



